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CRYSTAL STRUCTURE OF A DEACETYLASE AND INHIBITORS THEREOF 

5 

This application claims priority of U.S. Provisional 
Application No. 60/152,753, filed September 8, 1999, the 
contents of which are hereby incorporated by reference. 

10 

This invention has been made with government support under 
National Institutes of Health Grant No. ROl CA-65698. 
Accordingly, the U.S. Government may have certain rights in 
the invention. 

15 

Throughout this application, various publications are 
referenced by author, date and citation. The disclosures of 
these publications in their entireties are hereby 
incorporated by reference into this application in order to 

2 0 more fully describe the state of the art as known to those 

skilled therein as of the date of the invention described 
and claimed herein. 

Introduction 

25 The present invention relates to a histone deacetylase 
homologue from the hyper thermophilic bacterium Aquifex 
aeolicus, HDLP (histone deacetylase like protein; also known 
as AcuCl) , which shares 35.2 % sequence identity with human 
histone deacetylase (HDAC1) , that can be co-crystallized 

3 0 . with an inhibitory ligand, and more particularly, to the 

detailed crystal lographic data obtained from said co- 
crystallization which is disclosed herein. The invention 
also relates to methods of using the crystal structure and 
x-ray crystallographic coordinates of the apo-HDLP and 
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inhibitor-bound HDLP to design, isolate and screen compounds 
which bind to and inhibit the active site of HDLP and HDLP- 
related proteins, such as those proteins belonging to the 
HDAC family, including HDAC1 . 

5 

Background of the Invention 

The reversible modification of histones by acetylation is 
associated with changes in nucleosome conformation and 
chromatin structure, and plays an important role in the 

10 regulation of gene expression (reviewed in Davie and Chadee, 
1998, J. Cell Biochem. Suppl . 3001:203-213) - The histone 
acetylase and deacetylase enzymes that carry out these 
modifications are involved in many cellular processes such 
as cell cycle progression and differentiation, and their 

15 deregulation is associated with several types of human 
icancer (reviewed in Kouzarides, 1999, Curr. Opin. Genet. 
Dev. 2:40-48; Hassig et al . , 1997, Chem. Biol. 1:783-789; 
Fenrick and Heibert, 1998, J. Cell. Biochem. Suppl. 30- 
31:194-202) . 

20 

Recently, several experimental antitumor compounds, such as 
trichostatin A (TSA) , trapoxin, suberoylanilide hydroxamic 
acid (SAHA) , and phenylbutyrate have been shown to act, at 
least in part, by inhibiting histone deacetylases . Richon et 

25 al., 1998, Proc. Natl. Acad. Sci . , USA 95 : 3003-3007; Yoshida 
et al., 1990, J. Biol. Chem. 265:17174-17179; Kijima et al . , 
1993, J. Biol. Chem. 2£fi:22429-22435 . Additionally, diallyl 
sulfide and related molecules (Lea et al . , 1999, Jnt. J. 
Oncol. 2:347-352), oxamf latin (Kim et al . , 1999, Oncogene 

30 lj5:2461-2470) , MS-27-275, a synthetic benzamide derivative 
(Saito et al . , 1999, Proc. Natl. Acad. Sci. £6:4592-4597), 
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butarate derivatives (Lea and Tulsyan, 1995, Anticancer Res. 
1^:879-883), FR901228 (Nokajima et al . , 1998, Exp. Cell Res. 
241:126-133), depudecin (Kwon et al . , 1998, Proc. Natl. 
Acad. Sci. USA £5:3356-3361) and m-carboxysinnamic acid 
5 bishydroxamide ( CBHA ; Richon et al . , Proc . Natl. Acad. Sci. 
USA 95: 3003-3007) have been shown to inhibit histone 
deacetylases . In vitro, these compounds can inhibit the 
growth of fibroblast cells by causing cell cycle arrest in 
the Gl and G2 phases (Richon et al., 1996, Proc. Natl. Acad. 

10 Sci. USA £3 :5705-5708; Kim et al . , 1999, Oncogene 
18:2461-2470; Yoshida et al . , 1995, Bioessays 17:423-430; 
Yoshida & Beppu, 1988, Exp. Cell. Res. 122:122-131), and can 
lead to the terminal differentiation and loss of 
transforming potential of a variety of transformed cell 

15 lines. Richon et al . , 1996, Proc. Natl. Acad. Sci. USA 
92:5705-5708; Kim et al . , 1999, Oncogene 18:2461-2470; 
Yoshida et al . , 1987, Cancer Res. 42:3688-3691. In vivo, 
phenylbutyrate is effective in the treatment of acute 
promyelocytic leukemia in conjunction with retinoic acid. 

20 Warrell et al . , 1998, J. Natl. Cancer Inst. 90:1621-1625. 
SAHA is effective in preventing the formation of mammary 
tumors in rats, and lung tumors in mice. Desai et al . , 
1999, Proc. AACR 40: abstract #2396; Cohen et al.. , Cancer 
Res . , submitted . 

25 

Histone deacetylases catalyze the removal of acetyl groups 
from the e -amino groups of lysine residues clustered near 
the N-terminus of nucleosomal histones, and this process is 
associated with transcriptional repression (reviewed in 
30 Struhl, 1998, Genes Dev. 12:599-606). Deletion of the yeast 
histone deacetylase gene, rpd3, or its pharmacological 
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inactivation with trichostatin A reduces the transcriptional 
repression in a subset of promoters, such as those of 
Ume6 -regulated genes. Kadosh & Struhl, 1998, Mol . Cell. 
Biol. 18.: 5121-5127 . This is accompanied by the increased 
5 acetylation of H4 histones in the repressed promoter and its 
vicinity, but has no effect on histones at promoter distal 
regions. Kadosh & Struhl, 1998, Mol. Cell. Biol. 18:5121- 
5127; Rundlett et al . , 1998, Nature 392 : 831-835 . 

10 Histone deacetylases are recruited to specific promoters by 
associating with DNA-binding transcriptional repressors, 
either directly or through co-repressors that bridge the 
deacetylase to the transcriptional repressors. For example, 
the Mad and Ume6 repressors bind to the co-repressor Sin3A 

15 (Laherty et al . , 1997, Cell 89:349-356; Hassig et al . , 1997, 
Cell 82:341-347; Kadosh & Struhl, 1997, Cell 89:365-371), 
and the nuclear receptors bind N-CoR and the related SMRT 
co-repressors. Nagy et al . , 1997, Cell 89:373-380; Alland 
et al, 1997, Nature 387:49-55; Heinzel et al, 1997, Nature 

20 317:43-48. 

The deregulation of histone deacetylase recruitment appears 
to be one of the mechanisms through which these enzymes 
contribute to tumorigenesis . In acute promyelocytic 

25 leukemia (APL) , chromosomal translocations fuse. the retinoic 
acid receptor-a (RARa) to either PLZF or to PML. These 
fusion oncoproteins have aberrant transcriptional repression 
activity resulting, in part, through the recruitment of a 
co-repressor and, in turn, HDACs. Grignani et al, 1998, 

30 Nature 291:815-818; Lin et al . , 1998, Nature 351:811-814. 
Treatment of PLZF -RARa APL cells with TSA enhances their 
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responsiveness to retinoic acid-induced differentiation. 
Grignani et al, 1998, Nature 391:815-818; Lin et al . , 1998, 
Nature 121:811-814. 

5 The histone deacetylases comprise a large family of 
proteins, conserved from yeast to man, and are divided into 
two related classes. Class I is characterized by human 
HDAC1, 2, 3 (Taunton et al . , 1996, Science 272 :408-411 ; Yang 
et al., 1996, Proc. Natl. Acad. Sci. USA 93:12845-12850; 

10 Emiliani et al . , 1998, Proc. Natl. Acad. Sci. USA 95:2795- 
2800), and yeast RPD3 (Videl & Gaber, 1991, Mol. Cell. Biol. 
11:6317-6327), and class II by the human HDAC4 , 5, 6 
(Grozinger et al . , 1999, Proc. Natl. Acad. Sci. USA .96:4868- 
4873; Fischle, et al . , 1999, J". Biol. Chem. 274:11713- 

15 11720), and yeast HDA1 (Rundlett et al . , 1996, Proc. Natl. 

Acad. Sci. USA 22:14503-14508). The two classes share a 
-390 amino acid region of sequence similarity, comprising 
the deacetylase core, but are divergent outside this region. 
The histone deacetylase genes belong to an even larger 

20 superfamily (Leipe & Landsman, 1997, Nucleic Acids Res. 

25:3693-3697) that contains the prokaryotic acetoin 
utilization proteins (AcuC; 28.1% sequence identity to 
HDAC1) , and the prokaryotic acetylpolyamine amidohydrolases 
(APAH; 15.0 % sequence identity to HDAC1) . The enzymatic 

25 activity of AcuC is not clear, but its disruption reduces 
the ability of B. subtilis to breakdown acetoin and utilize 
it as a carbon source. Grundy et al . , 1993, Mol. Microbiol. 
10:259-271. APAHs catalyze the deacetylation of polyamines 
by cleaving a non-peptide amide bond (reviewed in Leipe & 

30 Landsman, 1997, Nucleic Acids Res . 2^:3693-3697). 
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It is useful to address the questions of how HDACs and HDAC- 
related proteins catalyze the deacetylation of histones and 
how the above -referenced compounds, particularly those 
compounds with antitumor activity, inhibit this activity in 
5 order to better understand the mechanism of inhibition of 
HDACs and to facilitate discovery of additional useful 
compounds which may inhibit this activity. To this end, the 
present invention has determined the three dimensional 
structure of a HDACl-like protein from the thermophilic 

10 bacterium Aquifex aeolicus, herein after HDLP. The 
determination of the nucleic acid coding sequence of HDLP 
was described by Deckert et al . , 1998, Nature 392: 353-358. 
The encoded 375 residue protein, whose sequence was 
determined from the nucleic acid encoding sequence, shares 

15 35.2% amino acid sequence identity with HDAC1 , deacetylates 
histones in vitro, and is inhibited by TSA, SAHA and several 
other HDAC inhibitors. The determination of the three- 
dimensional structure of HDLP is useful in the design, 
identification and screening of new HDAC family inhibitory 

2 0 compounds which are useful for the inhibition of cell growth 

both in vivo and in vitro. 

Summary of the Invention 

In general, it is the object of the present invention to 
25 provide detailed three-dimensional structural information 
for a family of proteins known as histone deacetylases 
(HDAC) , and particularly a homologue from the 
hyperthermophilic bacterium Aquifex aeolicus HDLP (histone 
deacetylase-like protein) which shares 35.2 % sequence 

3 0 identity with human histone deacetylase (HDAC1) . It is also 

an object of the present invention to provide three- 
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dimensional structural information of an HDLP bound to an 
inhibitory compound. 

In one embodiment of the invention, three-dimensional 
5 structure information is obtained from a crystal of wild- 
type HDLP (SEQ ID NO:l) (the nucleic acid encoding wild-type 
HDLP is SEQ ID NO:2). In a further embodiment of the 
invention, three-dimensional information is obtained from a 
mutant HDLP comprising two mutations (1) cysteine 75 to a 

10 serine and (2) cysteine 77 to a serine (Cys75Ser/Cys77Ser 
double mutant; SEQ ID NO: 3) (the nucleic acid encoding HDLP 
Cys75Ser/Cys77Ser double mutant is SEQ ID NO: 4) . The HDLP 
mutant of the present invention facilitates the 
determination of three-dimensional structural information of 

15 HDLP bound to a zinc atom at its zinc atom-binding site. 

In a preferred embodiment of the invention, the three- 
dimensional structural information is obtained from a co- 
crystal of a protein- inhibitor compound complex that 

20 comprises HDLP or HDLP Cys75Ser/Cys77Ser double mutant and 
trichostatin A (TSA) . In another preferred embodiment of 
the invention the three-dimensional structural information 
is obtained from a co-crystal of a protein- inhibitor 
compound complex that comprises HDLP or HDLP 

25 Cys75Ser/Cys77Ser double mutant and suberoylanilide 
hydroxamic acid (SAHA) . Any HDLP or HDLP- related protein 
(e.g. HDAC) inhibitor compound that may be co-crystallized 
with HDLP may be used to form a co-crystal of the present 
invention. 

30 

The protein crystals and protein- inhibitory complex co- 
crystals of the present invention diffract to a high 
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resolution limit of at least equal to or greater than 4 
angstrom (A) . In a preferred embodiment, the protein 
crystals and protein- inhibitory complex co-crystals of the 
present invention diffract to a high resolution limit of 
5 , greater than 2.5 A. 



A crystal of the present invention may take a variety of 
forms, all of which are contemplated by the present 
invention. In a preferred embodiment, the crystal has a 

10 space group of C2 with one molecule in the asymmetric unit 
and with unit dimensions of a = 51.4 A, b = 93.8 A, c = 
78.7 A and 3 = 96.9° (see, e.g., Example 2, below). In 
another preferred embodiment, the crystal has a space group 
of P2 1 2 1 2 1 with two molecules in the asymmetric unit and with 

15 unit dimensions of a = 53.4 A, b = 94.4 A, c = 156.3 A (see, 
e.g., Example 2, below). The HDLP structure comprises a 
parallel 3 sheet with a helices packing against both faces. 
At one end of the 3 sheet, the HDLP has a narrow, tube -like 
pocket formed by several well-ordered loops. The walls of 

20 the pocket are lined with hydrophobic residues and there is 
a zinc binding site and several polar side chains at the 
bottom of the pocket. The inhibitory compounds of the 
present invention bind in the pocket. 

25 The three-dimensional structural information obtained from 
crystals of HDLP, HDLP Cys75Ser/Cys77Ser double mutant, HDLP 
Cys75Ser/Cys77Ser double mutant comprising a zinc atom, HDLP 
comprising an inhibitory compound such as TSA or SAHA, and 
HDLP Cys75Ser/Cys77Ser double mutant comprising an inhibitor 

30 compound such as TSA or SAHA may be employed to solve the 
structure of any HDLP-related protein (e.g. HDAC) crystal, 
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or any mutant HDLP- related protein and particularly any wild 
type or mutant of HDLP-related protein complexed with a 
ligand, including a substrate or inhibitor compound. If the 
crystals are in a different space group than the known 
5 structure, molecular replacement may be employed to solve 
the structure, or if the crystals are in the same space 
group, refinement and difference fourier methods may be 
employed. The structure of HDLP-related proteins (e.g. 
HDAC1) comprise no greater than a 2.0 A root mean square 
10 deviation (rmsd) in the positions of the Ca atoms for at 
least 50% or more of the amino acids of the full-length HDLP 
structure . 

The present invention also provides a nucleic acid molecule 
15 encoding an HDLP Cys75Ser/Cys77Ser double mutant having the 
amino acid sequence of SEQ ID NO: 3 and the nucleic acid 
sequence of SEQ ID NO: 4. It is also contemplated by the 
invention that mutations be made in HDLP-related proteins at 
cysteine residues, as with the Cys75Ser/Cys77Ser double 
20 mutant, in order to facilitate the determination of the 
structure of said proteins bound to a zinc atom. 
Additionally, the present invention provides expression 
vectors which comprise the nucleic acid molecule encoding an 
HDLP Cys75Ser/Cys77Ser double mutant encoded by the sequence 
2 5 represented by SEQ ID NO: 4 operatively linked to expression 
control sequences . 

It is another object of the present invention to provide 
methods for the design, identification and screening of 
30 potential inhibitor compounds of the HDLP/HDAC family. In 
a preferred embodiment the method for the rational design, 
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identification and screening of potential inhibitor 
compounds for HDLP and HDLP-related proteins (e.g. HDACs) 
comprising deacetylase activity comprises the steps of: (a) 
using a three-dimensional structure of an HDLP as defined by 
5 the atomic coordinates of the present invention; (b) 
employing said three-dimensional structure to design or 
select said potential inhibitor compound; (c) synthesizing 
and/or selecting said potential inhibitor; (d) contacting 
said potential inhibitor compound with said enzyme in the 

10 presence of acetylated substrate; and (e) determining the 
percent inhibition of deacetylase activity to determine the 
inhibitory activity of said potential inhibitor compound. 
In a further preferred embodiment, the binding properties of 
said rationally designed inhibitory compound may be 

15 determined by a method comprising the steps of: (a) forming 
a complex comprising said inhibitory compound and HDLP or a 
HDLP-related protein, (b) co-crystallizing said inhibitory 
compound-HDLP complex; (c) determining said three- 
dimensional structure of said co-crystal through molecular 

c 

20 replacement or refinement and difference fourier with the 
molecular coordinates of HDLP as defined by the present 
invention; and (d) analyzing the three-dimensional structure 
to determine the binding characteristics of said potential 
inhibitor compound. 

25 

It is a further object of the present invention to identify 
a defined class of HDLP/HDAC family inhibitor compounds. 
The HDLP/HDAC family inhibitor compounds of the present 
invention are represented by formula (I) : 



30 
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(I) 



5 




wherein X comprises a cap group which binds to at least one 
amino acid selected from the group consisting of proline and 
leucine; Y comprises an aliphatic chain group which binds to 
10 at least one amino acid selected from the group consisting 
of leucine, phenylalanine and glycine; and Z comprises and 
active site binding group which binds to at least one amino 
acid selected from the group consisting of aspartic acid, 
tyrosine and histidine and may further bind to a zinc atom. 



20 



25 



30 
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Brief Description of the Drawings 

Figure 1 is a table listing the statistics from the X-ray 
crystallographic analysis of a HDLP crystal, a HDLP-TSA co- 
crystal, and a HDLP- SAHA co-crystal. 

5 

Figure 2 shows an alignment of various HDAC homologues with 
percent sequence identity depicted. 

Figure 3 shows a graph indicating the histone deacetylase 
10 activity of HDLP and HDAC1 and the inhibition of HDLP and 
HDAC1 by the inhibitors TSA and HC- toxin. 

Figure 4 shows (A & B) a schematic representation of the 
HDLP-Zn 2+ -TSA complex in two approximately orthogonal views, 
15 (C) a topology diagram of HDLP indicating the regions of 
homology with HDAC 1 , and (D) a close-up schematic 
representation of the HDLP -Zn 2+ - SAHA complex. 

Figure 5 shows (A) a schematic representation of a slice 
2 0 through a surface representation of HDLP with the pocket 
internal cavities and position of the 3 sheet indicated, (B) 
a schematic representation of a close-up view of the active 
site looking down into the pocket in an orientation similar 
to Figure 4B. 

25 

Figure 6 shows (A) a space-filling representation of TSA in 
the active site pocket, (B) a closeup stereo view of the 
structure of the HDLP-ZN 2+ -TSA complex in a similar 
orientation to Figure 4B, and (C) a schematic representation 
30 of the HDLP-TSA interactions. 



WO 01/18045 PC17US00/24700 

- 13 - 

Figure 7 shows (A) a schematic representation of the regions 
of homology shared between HDLP and HDAC1 in an orientation 
similar to that of Figure 4A, and (B) a detailed schematic 
representation of the homology shared in the pocket and 
5 internal cavity between HDLP and HDAC1 in an orientation 
similar to that of Figure 4B. 

Figure 8 shows a schematic representation of the proposed 
catalytic mechanism for the deacetylation of acetylated 
10 lysine. 

Figure 9 shows a schematic representation of a space filling 
diagram showing the conserved amino acids in the active site 
and nearby grooves. 

15 

Figure 10 is the nucleic acid sequence of HDLP from Aquifex 
aeolicus (SEQ ID NO. 2) . 

Figure 11 is the amino acid sequence of full length HDLP 
2 0 from Aquifex aeolicus (SEQ ID NO. 1) . 

Figure 12 is the nucleic acid sequence of the HDLP active 
site mutant Tyr297Phe (SEQ ID NO. 6). 

25 Figure 13 is the amino acid sequence, of the HDLP active site 
mutant Tyr2 97Phe (SEQ ID NO. 5) . 

Figure 14 is the nucleic acid sequence of a double mutant of 
HDLP from Aguifex aeolicus comprising a Cys75Ser and 
30 Cys77Ser mutation (SEQ ID NO. 4) . 
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Figure 15 is the amino acid sequence of a double mutant of 
HDLP from Aquifex aeolicus comprising a Cys75Ser and 
Cys77Ser mutation (SEQ ID NO. 3) . 

5 Figure 16-1 to 16-49 lists the atomic structure coordinates 
for HDLP as derived by X-ray diffraction from a crystal of 
HDLP . 

Figure 17-1 to 17-49 lists the atomic structure coordinates 
10 for HDLP Cys75Ser/Cys77Ser double mutant comprising a zinc 
atom in the active site as derived by X-ray diffraction from 
a crystal of the HDLP Cys75Ser/Cys77Ser double mutant. 

Figure 18-1 to 18-99 lists the atomic structure coordinates 
15 for HDLP Cys75Ser/Cys77Ser double mutant as derived by X-ray 
diffraction from a co-crystal of HDLP complexed with TSA. 

Figure 19-1 to 19-48 lists the atomic structure coordinates 
for HDLP Cys75Ser/Cys77Ser double mutant as derived by X-ray 
2 0 diffraction from a co-crystal of HDLP complexed with SAHA. 



25 



30 
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Detailed Description of the Inv ention 

The present invention provides crystals of a histone 
deacetylase (HDAC) homologue grown in the presence and 
absence of a compound capable of inhibiting the histone 
5 deacetylase activity of said HDAC homologue. As referred to 
herein, a HDAC homologue (as well as a HDLP-related protein) 
is any protein molecule having (a) greater than 15% sequence 
identity to over the 375 amino acid residues of HDLP; (b) 
having no more than twenty insertions or deletions for a 
10 total of no more than 100 amino acids; and (c) deacetylase 
activity. Sequence identity is calculated by the program 
DNAstar™ using the identity matrix weighing scheme clustal 
method (DNAstar program, Madison, WI) . 



15 A HDLP /HDAC inhibitor compound, as used herein, refers to 
any compound represented by Formula (I) : 
(I) 



20 




wherein X comprises a cap group which binds to at least one 
amino acid selected from the group consisting of tyrosine, 

25 proline and leucine; Y comprises an aliphatic chain group 
from about 5 to about 10 A, preferably 7 A, which binds to 
at least one amino acid selected from the group consisting 
of phenylalanine and glycine; and Z comprises a active site 
binding group which binds to at least one amino acid 

30 selected from the group consisting of aspartic acid, 
tyrosine and histidine and which may further bind to a zinc 
atom. The HDAC inhibitory compounds of the present 
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invention can inhibit greater than 50% of the histone 
deacetylase activity of a HDAC homologue or a HDLP- related 
protein. 

5 To grow the crystals of the present invention, the HDAC and 
HDAC- inhibitory compound complex are purified to greater 
than 80% total protein and more preferably purified to 
greater than 90% total protein. For expression and 
purification purposes, the full-length HDLP (Genbank 
10 accession number AE000719) may be subcloned from Aquifex 
aeolicus chromosomal DNA preparation by the polymerase chain 
reaction (PCR) and inserted into an expression vector. 



A large number of vector-host systems known in the art may 

15 be used. Possible vectors include, but are not limited to, 
plasmids or modified viruses, but the vector system must be 
compatible with the host cell used. Examples of vectors 
include E. coli bacteriophages such as lambda derivatives, 
or plasmids such as pBR322 derivatives or pUC plasmid 

20 derivatives, e.g., pGEX vectors (Amersham- Pharmacia, 
Piscataway, New Jersey) , pET vectors (Novagen, Madison, WI) , 
pmal-c vectors (Amersham- Pharmacia, Piscataway, New Jersey) , 
pFLAG vectors (Chiang and Roeder, 1993, Pept. Res. 6:62-64), 
baculovirus vectors (Invitrogen, Carlsbad, CA; Pharmingen, 

25 San Diego, CA) , etc. The insertion into a cloning vector 
can, for example, be accomplished by ligating the DNA 
fragment into a cloning vector which has complementary 
cohesive termini, by blunt end ligation if no complementary 
cohesive termini are available or by through nucleotide 

30 linkers using techniques standard in the art. E.g., 
Ausubel et al. . (eds.), Current Protocols in Molecular 
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Biology, (1992) . Recombinant vectors comprising the nucleic 
acid of interest may then be introduced into a host cell 
compatible with the vector (e.g. E . coli, insect cells, 
mammalian cells, etc.) via transformation, transf ection, 
5 infection, electroporation, etc. The nucleic acid may also 
be placed in a shuttle vector which may be cloned and 
propagated to large quantities in bacteria and then 
introduced into a eukaryotic cell host for expression. The 
vector systems of the present invention may provide 
10 expression control sequences and may allow for the 
expression of proteins in vitro. 

In a preferred embodiment, the full length HDLP (SEQ ID 
NO:2) is subcloned from Aquifex aeolicus chromosomal DNA 

15 preparation into pGEX4T3 (Amersham- Pharmacia, Piscataway, 
New Jersey) . In order to construct a double mutant 

comprising a Cys75Ser and Cys77Ser mutation (SEQ ID NO: 4), 
and to construct the HDLP active site mutant Tyr297Phe (SEQ 
.ID NO: 5 and SEQ ID NO: 6), PCR site directed mutagenesis may 

20 be employed with verification by DNA sequencing by methods 
known to those skilled in the art (see, e.g., Example 1 
below) . The mutants of the present invention may be 
subcloned into a suitable expression vector and introduced 
into a host cell for protein production, as described above. 

25 

The HDLP nucleic acids of the present invention may be 
subcloned into an expression vector to create an expression 
construct such that the resultant HDLP molecule which is 
produced comprises a fusion protein wherein said fusion 
3 0 protein comprises a tag for ease of purification. As 
referred to herein, a "tag" is any additional amino acids 
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which are provided in a protein either c-terminally , n- 
terminally or internally for the ease of purification, for 
the improvement of production or for any other purpose which 
may facilitate the goals of the present invention (e.g. to 
5 achieve higher levels of production and/or purification) . 
Such tags include tags known to those skilled in the art to 
be useful in purification such as, but not limited to, his 
tag, glutathione-s-transferase tag, flag tag, mbp (maltose 
binding protein) tag, etc. In a preferred embodiment, the 

10 wild-type and mutant HDLPs of the present invention are 
tagged with glutathione-s-transferase (see Example 1 below) . 
In another preferred embodiment, HDAC1 is flag tagged (see 
Example 1 below) . Such tagged proteins may also be 
engineered to comprise a cleavage site, such as a thrombin, 

15 enterokinase or factor X cleavage site, for ease of removal 
of the tag before, during or after purification. Vector 
systems which provide a tag and a cleavage site for removal 
of the tag are particularly useful to make the expression 
constructs of the present invention. 

20 

The tagged HDLPs and HDACs of the present invention may be 
purified by immuno-af f inity or conventional chromatography, 
including but not limited to, chromatography employing the 
. following: glutathione- sepharose™ (Amersham- Pharmacia, 

25 Piscataway, New Jersey) or an equivalent resin, nickel or 
cobalt -purification resins, anion exchange chromatography, 
cation exchange chromatography, hydrophobic resins, gel 
filtration, antiflag epitope resin, reverse phase 
chromatography, etc. After purification, the HDLP and HDLP- 

3 0 inhibitor compound complex may be concentrated to greater 
than 1 mg/ml for crystallization purposes. In a preferred 
embodiment HDLP and HDLP- inhibitor complexes are 
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concentrated to greater than 10 mg/ml for crystallization 
and in a particularly preferred embodiment, HDLP and HDLP- 
inhibitor complexes are concentrated to greater than 20 
mg/ml . 

5 

In order to determine whether the purified HDLPs of the 
present invention demonstrate histone deacetylase activity, 
the purified HDLPs and also any HDLP- related protein may be 
assayed by any method known to those skilled in the art for 

10 the determination of said activity. In a preferred 
embodiment, the purified HDLPs of the present invention are 
incubated in the presence of [ 3 H] acetyl -labeled histone 
substrate (Carmen et al., 1996, J". Biol. Chem. 271:15837- 
15844) in a buffer suitable for detection of histone 

15 deacetylase activity (see Example 3 below) ; stopping the 
reaction; extracting the released acetate and measuring said 
released acetate, as described by Henzel et al . (J*. Biol. 
Chem. 2££:21936-21942 (1991); Example 3 below). In a 
preferred embodiment, the HDLPs of the present invention are 

2 0 inclubated in the presence of ZnCl 2 in order to obtain 

histone deacetylase activity therefrom (Example 3 below) . 

In another embodiment, the crystals of the present invention 
comprise purified wild-type HDLP (SEQ ID N0:1) and are grown 
25 at room temperature by the hanging-drop vapor-diffusion 
method from a crystallization solution comprising one or 
more precipitants selected from the group consisting of 
isopropanol, polyethylene glycol, and tert butanol (see 
Example 2 below) . The crystallization solution may further 

3 0 comprise one or more salts including salts selected from the 

group consisting of NaCl and KC1, and one or more buffers 
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including buffers selected from the group consisting of Tris 
(tris (hydroxymethyl) aminomethane and bis-tris propane-Cl 
(1, 3-bis [tris (hydroxymethyl) methyl -amino] propane) (see 
Example 2 below) . The pH of the crystallization solution is 
5 preferably between pH 5 to 9, although other pH values are 
also contemplated by the present invention (see Example 2 
below) . 

Any crystallization technique known to those skilled in the 
art may be employed to obtain the crystals of the present 
invention, including, but not limited to, batch 
crystallization, vapor diffusion (either by sitting drop or 
hanging drop) and micro dialysis. Seeding of the crystals 
in some instances may be required to obtain X-ray quality 
crystals. Standard micro and/or macro seeding of crystals 
may therefore be used. 



10 



15 



The crystals of the present invention may form in the space 
group C2 with one molecule in the asymmetric unit and with 

20 unit dimensions of a = 51.4 A, b = 93.8 A, c = 78.7 A and 
3 = 96.9° (see Example 2 below). The crystals of the 
present invention may also form in the space group P2 1 2 1 2 1 
with two molecules in the asymmetric unit and with unit 
dimensions of a = 53.4 A, b = 94.4 A, c = 156.3 A (see 

25 Example 2 below) . However, the present invention 

contemplates crystals which form in any space group 
including, but not limited to, C2 , P2 1# P2 1 2 1 2 1 , P3i21, 
P4 3 2 1 2 1/ and 0222^ The crystals diffract to a resolution 
greater than 4 A, preferably greater than 2.5 A. 

30 

To collect diffraction data from the crystals of the present 



WO 01/18045 PCT/US00/24700 

- 21 - 

invention, the crystals may be flash-frozen in the 
crystallization buffer employed for the growth of said 
crystals, however with preferably higher precipitant 
concentration (see, e.g., Example 2 below). For example, 
5 but not by way of limitation, if the precipitant used was 
28% PEG 1500, the crystals may be flash frozen in the same 
crystallization solution employed for said crystal growth 
wherein the concentration of the precipitant is increased to 
35% (see Example 2 below) . If the precipitant is not a 

10 sufficient cryoprotectant (i.e. a glass is not formed upon 
flash-freezing), cryoprotectants (e.g. glycerol, low 
molecular weight PEGs, alcohols, etc) may be added to the 
solution in order to achieve glass formation upon flash- 
freezing, providing the cryoprotectant is compatible with 

15 preserving the integrity of the crystals. The flash- frozen 
crystals are maintained at a temperature of less than -110°C 
and preferably less than -150°C during the collection of the 
crystallographic data by X-ray diffraction. The X-ray 
diffraction data may be processed with DENZO and SCALEPACK 

20 (Otwinowski & Minor, 1997, Method Ensemble. 276:307-326) but 
any method known to those skilled in the art may be used to 
process the X-ray diffraction data. 

In order to determine the atomic structure of HDLP according 
25 to the present invention, multiple isomorphous replacement 
(MIR) analysis, model building and refinement may be 
performed. For MIR analysis, the crystals may be soaked in 
heavy-atoms to produce heavy atom derivatives necessary for 
MIR analysis. As used herein, heavy atom derivative or 
30 derivitization refers to the method of producing a 
chemically modified form of a protein or protein complex 
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crystal wherein said protein is specifically bound to a 
heavy atom within the crystal. In practice a crystal is 
soaked in a solution containing heavy metal atoms or salts, 
or organometallic compounds, e.g., lead chloride, gold 
5 cyanide, thimerosal , lead acetate, uranyl acetate, mercury 
chloride, gold chloride, etc, which can diffuse through the 
crystal and bind specifically to the protein. The 
location (s) of the bound heavy metal atom(s) or salts can be 
determined by X-ray diffraction analysis of the soaked 

10 crystal. This information is used to generate MIR phase 
information which is used to construct the three-dimensional 
structure of the crystallized HDLPs and HDLP-related 
proteins of the present invention. In a preferred 
embodiment, the heavy atoms comprise thimerosal, KAu(CN) 2 and 

15 Pb(Me) 3 OAc (see Example 2 below). The MIR phases may be 
calculated by any program known to those skilled in the art 
and preferably with the program MLPHARE (The CCP4 suite: 
Programs for computational crystallography, 1994, Acta 
Crystallogr. D. 50:760-763) and may also use the anomalous 

20 diffraction signal from the thimerosal derivative. In a 
preferred embodiment, the MIR phases were calculated at 2 . 5 
A and have a mean figure of merit of 0.55 (see Figure 19 and 
Example 2 below) . The phases may be improved where 
necessary by solvent flattening by methods known to those 

25 skilled in the art including, but not limited to, through 
the use of the program DM (The CCP4 suite: Programs for 
computational crystallography, 1994, Acta Crystallogr. D 
5&:760-763) . 



30 Thereafter, an initial model of the three-dimensional 
structure may be built using the program O (Jones et al . , 
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1991, Acta Crystallogr. A 47 : 110-119) . The interpretation 
and building of the structure may be further facilitated by 
use of the program CNS (Brunger et al . , 1998, Acta 
Crystallogr. D 54:905-921) . 

5 

For the determination of the HDLP-inhibitor compound complex 
structure, if the space group of the HDLP-inhibitor compound 
complex crystal is different, molecular replacement may be 
employed using a known structure of apo-HDLP ( as referred 

10 to herein, apo-HDLP or apo-HDAC is the enzyme which is not 
complexed with an inhibitor compound) or any known 
HDLP/inhibitor complex structure whose structure may be 
determined as described above and below in Example 2. If 
the space group of the HDLP-inhibitor compound crystals is 

15 the same, then rigid body refinement and difference fourier 
may be employed to solve the structure using a known 
structure of apo-HDLP ( as referred to herein, apo-HDLP or 
apo-HDAC is the enzyme which is not complexed with an 
inhibitor compound) or any known HDLP/inhibitor complex 

20 structure. 



The term "molecular replacement" refers to a method that 
involves generating a preliminary model of the three- 
dimensional structure of the HDLP crystals of the present 

25 invention whose structure coordinates are unknown prior to 
the employment of molecular replacement . Molecular 
replacement is achieved by orienting and positioning a 
molecule whose structure coordinates are known (in this case 
the previously determined apo-HDLP) within the unit cell as 

30 defined by the X-ray diffraction pattern obtained from an 
HDLP or HDLP-related protein crystal whose structure is 
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unknown so as to best account for the observed diffraction 
pattern of the unknown crystal. Phases can then be 
calculated from this model and combined with the observed 
amplitudes to give an approximate Fourier synthesis of the 
5 structure whose coordinates are unknown. This in turn can 
be subject to any of several forms of refinement to provide 
a final, accurate structure. 

Any method known to the skilled artisan may be employed to 
10 determine the structure by molecular replacement. For 
example., the program AMORE (The CCP4 suite: Programs for 
computational crystallography, 1994, Acta Crystallogr. D. 
SO: 760-763) may be employed to determine the structure of an 
unknown histone deacetylase + /- an inhibitor by molecular 
15 replacement using the apo-HDLP coordinates (Figure 16) . For 
the structure determination of the inhibitory compound TSA, 
the structure of TSA was obtained from the Cambridge 
Structural Database (Refcode TRCHST, << 

http://www.ccdc.cam.ac.uk >>) may be employed to define the 

2 0 stereochemical restraints used in the refinement with the 

program CNS (Brunger et al . , 1998, Acta Crystallogr. D 
54 :905-921) . 

The three-dimensional structural information and the atomic 
25 coordinates associated with said structural information of 
HDLP are useful for solving the structure of crystallized 
proteins which belong to the HDAC family by molecular 
replacement. Similarly, any structure of a crystallized 
protein which is thought to be similar in structure based on 

3 0 function or sequence similarity or identity to HDLP may be 

solved by molecular replacement with the HDLP structural 
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information of the present invention. The structure of 
HDLP- related proteins as determined by molecular replacement 
as described above and in Example 2 below, comprise a root 
mean square deviation (rmsd) of no greater than 2.0 A in the 
5 positions of Ca atoms for at least 50% or more of the amino 
acids of the structure over the 375 residues of full-length 
HDLP. Such a rmsd may be expected based on the amino acid 
sequence identity. Chothia & Lesk, 1986, Embo J. 5:823-826. 



10 The refined three-dimensional HDLP structures of the present 
invention, specifically apo-HDLP, Cys75Ser/Cys77Ser double 
mutant HDLP comprising a zinc atom in the active site, 
HDLP/TSA complex comprising a zinc atom in the active site, 
and HDLP/SAHA complex comprising a zinc atom in the active 

15 site, are represented by the atomic coordinates set forth in 
Figures 16 to 19 respectively. The refined model for apo- 
HDLP comprising amino acids 1-375 consists of wild-type HDLP 
residues 2 to 373 with residues 1, 374 and 375 not modeled 
and presumed disordered and was determined to a resolution 

20 of 1.8 A. Similarly, the refined model for 

Cys75Ser/Cys77Ser double mutant HDLP comprising a zinc atom 
in the active site also consists of residues 2 to 373 with 
residues 1, 374 and 375 not modeled and presumed disordered 
and was determined to a resolution of 2.0 A. The refined 

25 model for the HDLP/TSA complex comprising a zinc atom in the 
active site consists of the Cys75Ser/Cys77Ser double mutant 
HDLP residues 2 to 373 with residues 1, 374 and 375 not 
modeled and presumed disordered, has TSA in the binding 
pocket and was determined to a resolution of 2.1 A. The 

3 0 HDLP/SAHA complex is similar to the HDLP/TSA complex but has 
SAHA in the binding pocket and was determined to a 
resolution of 2.5 A. 
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For the purposes of further describing the structure of HDLP 
and HDLP-related proteins, including, but not limited to, 
HDACs, from the data obtained from the HDLP crystals of the 
present invention, the definition of the following terms is 
5 provided : 



The term "(3 sheet" refers to two or more polypeptide chains 
(or 3 strands) that run alongside each other and are linked 
in a regular manner by hydrogen bonds between the main chain 

10 C=0 and N-H groups. Therefore all hydrogen bonds in a 
beta-sheet are between different segments of polypeptide. 
Most (3-sheets in proteins are all-parallel (protein 
interiors) or all-antiparallel (one side facing solvent, the 
other facing the hydrophobic core) . Hydrogen bonds in 

15 antiparallel sheets are perpendicular to the chain direction 
and spaced evenly as pairs between strands. Hydrogen bonds 
in parallel sheets are slanted with respect to the chain 
direction and spaced evenly between strands. 

2 0 The term u a helix" refers to the most abundant helical 
conformation found in globular proteins. The average length 
of an a helix is 10 residues. In an a helix, all amide 
protons point toward the N- terminus and all carbonyl oxygens 
point toward the C-terminus. The repeating nature of the 

25 phi, psi pairs ensure this orientation. Hydrogen bonds 
within an a helix also display a repeating pattern in which 
the backbone C=0 of residue X (wherein X refers to any amino 
acid) hydrogen bonds to the backbone HN of residue X+4 . 
The a helix is a coiled structure characterized by 3.6 

30 residues per turn, and translating along its axis 1.5 A per 
amino acid. Thus the pitch is 3.6x1.5 or 5.4 A. The screw 
sense of alpha helices is always right-handed. 
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The term "loop'' refers to any other conformation of amino 
acids (i.e. not a helix, strand or sheet). Additionally, a 
loop may contain bond interactions between amino acid side 
chains, but not in a repetitive, regular fashion. 

5 

Amino acid residues in peptides shall herein after be 
abbreviated as follows: Phenylalanine is Phe or F; Leucine 
is Leu or L; Isoleucine is lie or I; Methionine is Met or M; 
Valine is Val or V; Serine is Ser or S; Proline is Pro or P; 

10 Threonine is Thr or T; Alanine is Ala or A; Tyrosine is Tyr 
or Y; Histidine is His or H; Glutamine is Gin or Q; 
Asparagine is Asn or N; Lysine is Lys or K; Aspartic Acid is 
Asp or D; Glutamic Acid is Glu or E; Cysteine is Cys or C; 
Tryptophan is Trp or W; Arginine is Arg or R; and Glycine is 

15 Gly or G. For further description of amino acids, please 
refer to Proteins: Structure and Molecular Properties by 
Creighton, T.E., W.H. Freeman & Co., New York 1983. 

The term "positively charged amino acid" refers to any amino 
20 acid having a positively charged side chain under normal 
physiological conditions. Examples of positively charged 
amino acids are Arg, Lys and His. The term "negatively 
charged amino acid" refers to any amino acid having a 
negatively charged side chain under normal physiological 
25 conditions. Examples of negatively charged amino acids are 
Asp and Glu. The term "hydrophobic amino acid" refers to 
any amino acid having an uncharged, nonpolar side chain that 
is relatively insoluble in water. Examples of hydrophobic 
amino acids are Ala, Leu, lie, Gly, Val, Pro, Phe, Trp and 
30 Met. The term "hydrophilic amino acid" refers to any amino 
acid having an uncharged, polar side chain that is 
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relatively soluble in water. Examples of hydrophilic amino 
acids are Ser, Thr, Tyr, Asp, Gin, and Cys. The term 
"aromatic amino acid" refers to any amino acid comprising a 
ring structure. Examples of aromatic amino acids are His, 
5 Phe, Trp and Tyr. 

The term "charge relay system" refers to a His -Asp 
arrangement as described by Fersht & Sperling, 1973, J. Mol . 
Biol. 74:137-149; Blow et al . , 1969, Nature 221:337-340. 

10 

The information obtained from the three-dimensional 
structures of the present invention reveal that HDLP has a 
single-domain structure that belongs to the open a/3 class 
of folds (see, e.g., Branden, 1980, Q. Rev. Biophys . 

15 13.: 317-38) . Two orthogonal views of the overall three- 
dimensional structure of HDLP are depicted in Figure 4A and 
4B. The HDLP structure has a central eight -stranded 
parallel (5 sheet (strands arranged as 32-3l-(33-(38-p7-P4-(35- 
36) , and sixteen a helices (labeled al through al6 

20 respectively) . See Figure 4C. Four of the helices pack on 
either face of the 3 sheet (a7, a8, a9, alO and all, al2, 
al3, al4) forming the core a/3 structure characteristic of 
this class of folds. Most of the remaining eight helices 
are positioned near one side of the 3 sheet, near stands 32- 

25 31-33-38. Large, well defined loops (Loops L1-L7; Figure 
4C) originate from the C-terminal ends of the 3-strands. 
The extra helices and the large L1-L7 loops are associated 
with a significant extension of the structure beyond the 
core a/3 motif. This extension of the structure gives rise 

30 to two prominent architectural features: a deep, narrow 
pocket and an internal cavity adjacent to the pocket. These 
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two architectural features comprise the active site (see 
Figure 5A) . The structure of HDLP- related proteins (e.g. 
HDACs) may also comprise the conserved a/|3 structure 
characteristic . 

The term "active site" comprises any or all of the following 
sites in HDLP, the substrate binding site, the site where 
the cleavage of an acetyl group from a substrate occurs or 
the site where an inhibitor of the HDAC family or, more 
particularly, HDLP binds. The active site, as referred to 
herein, comprises Aspl66, Asp258, Hisl70, Tyr297, Hisl31, 
Hisl32, Aspl68, Aspl73, Phel41, Phel98, Leu265, Pro22 and 
Glyl4 0, and also a metal bound at the bottom of the pocket 
by Aspl73, Aspl68 and His defined by the coordinates listed 
in Figures 16 to 19 with an rmsd of 2.0 A. The metal which 
binds at the bottom of the pocket will be a divalent cation 
selected from the group consisting of zinc, cobalt or 
manganese . 

The deep narrow pocket has a tube -like shape with a depth of 
- 11 A. The pocket opening constricts half way down to - 
4.5 by 5.5 A, and becomes wider at the bottom (see Figure 
5A) . The pocket and its immediate surroundings are made up 
of loops LI through L7 . 

The walls of the pocket are covered with side chains of 
hydrophobic and aromatic residues (Pro22, Tyr91 near the 
entrance; and Glyl40, Phel41, Phe 198, Leu265 and Tyr297 
further down; Figure 5B) . For numbering of amino acids 
please refer to SEQ ID NO:L Of particular interest are 
Phe 141 and Phel98, whose phenyl groups face each other in 
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parallel at a distance of 7.5 A, marking the most slender 
portion of the pocket (see Figure 5B) . Of particular 
interest is that only one pocket residue differs in HDAC1 
when the sequences are aligned (alignment may be 
accomplished using DNAstar™ MegAlign™ program, Madison, WI) , 
this residue is Glu98 of HDAC1 which is Tyr91 in HDLP. The 
structure reveals that this residue in HDLP is mostly 
solvent exposed. 

Near the bottom of the pocket of the active site at its 
narrowest point, is located a zinc ion (see Figure 6A) . In 
order to obtain the zinc in the structure, the crystals may 
be soaked in zinc (e.g. ZnCl 2 ) or co-crystalized in the 
presence of zinc. The zinc ion is coordinated by Aspl68 
(051, 2.1 A), Hisl70 (N61, 2.1 A), Asp258 (061, 1.9 A) and 
a water molecule (2.5 A) . See Figure 5B and 6B. The amino 
acid residues that coordinate zinc are arranged in a 
tetrahedral geometry, but the position of the water 
molecule, which is also hydrogen bonded to Hisl31, deviates 
from this geometry by -25°. 

In addition to the zinc ligands, the bottom of the pocket 
contains two histidine (Hisl31 and Hisl32), two aspartic 
acids (Aspl66 and Aspl73) and a tyrosine (Tyr297) . See 
Figure 5B and 10B. Each of the histidines makes a hydrogen 
bond through its N51 to an aspartic acid carboxylate oxygen, 
with the oxygen located in the plane of the imidizole ring 
(Figure 5B) . This His-Asp arrangement is characteristic of 
the charge relay system present in the active sites of 
serine proteases, where it serves to polarize the imidizole 
Ne and increase its basicity. Fersht & Sperling, 1973, J. 
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Mol. Biol. 71:137-149; Blow et al . , 1969, Nature 221:337- 
340. 

The Aspl66-Hisl31 charge pair relay (hereafter referred to 
as M buried charged relay") is positioned even deeper in the 
pocket and more buried compared to the Aspl73-Hisl32 charge 
relay (hereafter referred to as "exposed charge relay") 
which is partially solvent exposed. The buried charge relay 
makes a hydrogen bond (2.6 A) to the zinc-bound water 
molecule referred to above, and this hydrogen bond could 
contribute to the deviation of the water- zinc coordination 
from ideal geometry (Figure 5B) . The exposed charge relay 
is directed to a point - 2.5 A away from the water molecule, 
and closer to the surface. 

Tyr 2 97 is positioned next to the zinc, opposite from where 
the two charge relay systems are located. The Tyr hydroxyl 
group lies 4.4 A away from the zinc atom and has no 
interactions with the rest of the protein (Figure 5B) . Next 
to Tyr297, there is an opening in the pocket wall, which 
leads to the adjacent internal cavity. 

The floor of the internal cavity is made up of portions of 
the L3 and L7 loops as they emerge from the 3 strands, and 
the roof is made up by the al-Ll-a2 segment. The LI loop 
appears more flexible than other loops in the structure. 
This may allow the transient exchange of the cavity contents 
with the bulk solvent. 

The cavity is lined primarily with hydrophobic residues and 
is particularly rich in glycine residues (Alal27, Glyl28, 
Glyl29, Metl30, and Phel41 of L3 ; Gly293, Gly294, Gly295 and 
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Gly2 96 of L7; and Tyrl7, Pro22 and Leu23 of LI) . There are 
only two charged residues in the cavity (Arg27 and His 21) 
and these are contributed by the LI loop. 

The cavity may provide space for the diffusion of the 
acetate product away from the catalytic center, which may 
otherwise be crowded and shielded during deacetylation from 
the solvent when the substrate is bound. Such a role for 
the cavity is supported by the observation that the cavity 
contains three water and two isopropanol molecules (from the 
crystallization buffer) in the 1.8 A apo-protein structure. 
The cavity may also bind another cofactor, in addition to 
zinc, for the facilitation of the enzymatic activity of the 
HDLP. A proposed catalytic mechanism for deacetylation is 
provided in Figure 8 . 

The structure of HDLP as defined by the present invention, 
in conjunction with the HDAC1 sequence homology, shows that 
the 375-amino acid HDLP protein corresponds to the histone 
deacetylase catalytic core which is conserved across the 
HDAC family (see Figure 2). The 35.2% HDLP-HDAC1 sequence 
identity predicts structural similarity with a rmsd in Ca 
positions of ~ 1.5 A. Chothia and Lesk describe the 
relation between the divergence of sequence and the 
structure of proteins in Embo J. 5:823-826 (1986). The 40- 
residue C- terminus of HDLP is likely to have a divergent 
structure since this region has lower homology to HDAC1, 
although the al6 helix in this region is part of the 
conserved open a/3 core fold and HDAC1 is likely to comprise 
a similar helix. However divergent this . C-terminal region 
may be, this region is outside the active site and is likely 
to not effect the structure of the active site. Beyond the 
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C- terminus of the histone deacetylase catalytic core, HDAC 
family members are divergent in length and sequence. In the 
HDAC family, this region (amino acid residues -390-482) is 
highly polar, populated with acidic residues, and is likely 
5 to be flexible or loosely folded. 

The HDL P- HDAC homology maps primarily to the hydrophobic 
core and to the L1-L7 loops, with portions of the loops that 
make up the pocket and adjacent cavity having the highest 
level of amino acid residue sequence conservation (Figure 9A 
and 9B) . Specifically, all of the polar residues in the 
active site (the zinc ligands, the two charge relay systems, 
and Tyr297) and the hydrophobic residues that make up the 
walls of the pocket (Glyl40, Phel41, Phel98 and Leu265) are 
identical. Among the residues that make up the internal 
cavity, the ones closest to the active site are either 
identical or conservatively substituted (for example, Leu23 
Met and Metl3 0-+ Leu) . Surface residues around the pocket 
are conserved to a lesser extent, but are still above 35% 
average sequence identity. 

The information obtained from the inhibitor-bound HDLP 
complex crystal structures of the present invention reveal 
detailed information which is useful in the design, 
25 isolation, screening and determination of potential 
inhibitor compounds which may inhibit HDLP /HDAC family 
members. As described above, the HDLP structure consists of 
a parallel (3 sheet with a helices packing against both faces 
(Figure 4A, 4B, and 4C) . At one end of the 3 sheet, 7 loops 
30 (L1-L7) form a narrow, tube-like pocket which are lined with 
hydrophobic residues and which comprise a zinc binding site, 
several polar side chains, including two Asp-His charge 



15 



WO 01/18045 PCT/US00/24700 

- 34 - 

relay systems. Mutation of the zinc ligands and other polar 
residues at the pocket bottom reduces or eliminates the 
catalytic activity. 

The present inventors found that mutation at the Tyr2 97Phe 
site reduced activity. See also, Hassig et al . , 1998, Proc. 
Natl. Acad. Sci. USA 95:3519-3524; Kadosh & Struhl, 1998, 
Genes Dev. 12:797-805. The elimination of activity by 
mutation of these residues indicates that this region is the 
enzyme active site. Adjacent to the active site, there is 
an internal cavity that may provide space for the diffusion 
of the acetate reaction product. Homology at the active 
site between HDLP and HDAC1, as described above, indicates 
that they share structural and functional homology. 

The inhibitor compound, trichostatin A (TSA) (Tsuji et al . , 
1976, J. Antibiotics 2.9:1-6) binds HDLP by inserting its 
long aliphatic chain, which has a hydroxamic acid group at 
one end, into the pocket (Figure 6A, 6B and 6C) . The 
aliphatic chain makes multiple contacts in the well-like, 
hydrophobic portion of the pocket. The hydroxamic acid 
reaches the polar bottom of the pocket, where it coordinates 
the zinc in a bidentate fashion and also forms hydrogen 
bonds with the polar residues in the active site, including 
the two charge relay system histidines. The aromatic 
dimethylamino-phenyl group at the other end of the TSA chain 
makes contacts at the pocket entrance and serves to cap it . 
The amino acid residues of HDLP which contact TSA are 
conserved in HDAC, indicating that TSA binds and inhibits 
HDAC in a similar fashion to HDLP. 
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In the complex, the hydroxamic acid, most of the aliphatic 
chain and part of the dimethylamino-phenyl group of TSA are 
buried (60% of TSA's surface area; Figure 6A) . The 
hydroxamic acid group binds the zinc in a bidentite fashion 
forming bonds through its carbonyl (2.4 A) and hydroxyl 
groups (2.2 A) resulting in a penta-coordinated Zn 2+ (Figure 
6B and 6C) . The hydroxamic acid hydroxyl group replaces the 
water molecule that binds to the zinc in the apo-HDLP 
structure described above. The hydroxamic acid also 
hydrogen bonds with both charge relay system histidines 
(Hydroxyl oxygen to Hisl31 Ne2 , 2.8 A; and nitrogen to 
Hisl32 Ne2, 2.8 A) , and the Tyr297 hydroxyl group (2.4 A; 
Figure 6B and 6C) . 

The 5 -carbon long branched alkene chain of TSA fits snugly 
in the narrow portion of the pocket making multiple van der 
Waals contacts with all of the hydrophobic groups lining the 
pocket (Figure 6B and 6C) . Near its center, the chain 
contains a methyl substituted carbon-carbon double bond 
which is sandwiched between the phenyl groups of the Phel41 
and Phe98 at the tightest point of the pocket (Figure 6A and 
6B) . The length of the alkene chain appears optimal for 
spanning the length of the pocket, and allowing contacts 
both at the bottom and at the entrance of the pocket, 
although, the cap group of Formula (I) may provide length to 
span the pocket allowing for a shorter alkene chain 
(aliphatic chain) . 

At the entrance of the pocket, one face of the planar 
structure formed by the dimethylamino-phenyl and adjacent 
carbonyl groups of TSA makes contacts at the rim of the 
pocket (Pro22, Tyr91, Phel41; Figure 6B and 6C) . This 
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packing is facilitated by the roughly 110° angle in the 
overall structure of TSA at the junction of the aliphatic 
chain and the dimethylamino -phenyl group (occurring at the 
sp 3 hybridized C8 carbon) . Upon TSA binding, the side chain 
of Tyr91, which is mostly solvent exposed, changes 
conformation to make space for the dimethylamino-phenyl 
group. This is the only change near the active site 
observed upon TSA binding. 

The hydroxamic acid group is a common motif in zinc 
metalloprotease inhibitors. See U.S. Patent No. 5,919,940 
and 5,917,090; See also, Grams et al . , 1995, Biochemistry 
24:14012-14020; Lovejoy et al . , 1999, Nat. Struct. Biol. 
6:217-221; and Holmes & Matthews, 1981, Biochemistry 
2.0:6912-6920. Like TSA, these inhibitors also coordinate 
the active site zinc in a bidentate fashion using their 
hydroxamate hyroxyl and carbonyl oxygens, replace the 
nucleophilic water molecule with their hydroxamate hydroxyl 
groups and form hydrogen bonds to the general base (Grams et 
al., 1995, Biochemistry 34:14012-14020; Lovejoy et al . , 
1999, Nat. Struct. Biol. £:217-221; and Holmes & Matthews, 
1981, Biochemistry 2^:6912-6920) . 

SAHA, which has a ~30-fold weaker inhibitory activity than 
TSA (Richon et al . , 1998, Proc. Natl. Acad. Sci. USA 
15:3003-3007), binds HDLP similarly to TSA (see, e.g., 
Figure 4D) . The SAHA hydroxamic acid group makes the same 
contacts to the zinc and active site residues, and the 
importance of these interactions is underscored by the loss 
of activity of SAHA derivatives lacking the hydroxamic group 
(Richon et al . , 1998, Proc. Natl. Acad. Sci. USA 9£:3003- 
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3007) . The six-carbon long aliphatic chain of SAHA packs in 
the tube-like hydrophobic portion of the pocket. Compared 
to TSA however, SAHA' s aliphatic chain packs less snugly and 
makes fewer van der waals contacts, in part, because SAHA 
5 lacks TSA's C15 methyl group branch. SAHA also lacks TSA' s 
double bonds in this region, and this may lead to increased 
flexibility of the aliphatic chain. The cap group of SAHA 
consists of a phenyl-amino ketone group. In the crystal 
structure, the phenyl group has weak electron density, 
10 suggesting that it does not pack as well as the cap group of 
TSA. This may be due to the larger separation between the 
hydroxamic and cap groups of SAHA compared to TSA (compare 
TSA, Formula (II) and SAHA, Formula (III), below). 

15 (II) 



20 




25 



(III) 
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The determination of the structure of HDLP and HDLP bound to 
an inhibitory compound has enabled, for the first time, the 
identification of the active site of HDLP and of related 
HDLP proteins, such as proteins belonging to the HDAC 
5 family. 

The three-dimensional structural information and the atomic 
coordinates associated with said structural information of 
HDLP bound to an inhibitory compound is useful in rational 

10 drug design providing for a method of identifying inhibitory 
compounds which bind to and inhibit the enzymatic activity 
of HDLP, HDAC family proteins and other histone deacetylase- 
like proteins related to HDLP. Said method for identifying 
said potential inhibitor for an enzyme comprising 

15 deacetylase activity comprises the steps of (a) using a 
three-dimensional structure of HDLP as defined by its atomic 
coordinates listed in Figure 16 to 19; (b) employing said 
three-dimensional structure to design or select said 
potential inhibitor; (c) synthesizing said potential 

20 inhibitor; (d) contacting said potential inhibitor with said 
enzyme in the presence of an acetylated substrate; and (e) 
determining the ability of said inhibitor to inhibit said 
deacetylase activity. 

25 The potential HDLP and HDLP-related (e.g. HDAC) inhibitors 
identified by the method of the present invention are 
represented by formula (I) 



30 
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(I) 



© 




© 



wherein X comprises a cap group which binds to at least one 
amino acid selected from the group consisting of proline and 
leucine; Y comprises an aliphatic chain group which binds to 
at least one amino acid selected from the group consisting 
of leucine, phenylalanine and glycine; and Z comprises an 
active site binding group which binds to at least one amino 
acid selected from the group consisting of aspartic acid, 
tyrosine and histidine and wherein Z may further bind to a 
zinc atom and with the provision that the compound of 
Formula (I) is not TSA, trapoxin, SAHA, SAHA derivatives 
described in U.S. Patent Nos . 5,6.08,108; 5,700,811; 
5,773,474; 5840,960 and 5,668,179. 

The present invention permits the use of molecular design 
techniques to design, identify and synthesize chemical 
entities and compounds, including inhibitory compounds, 
capable of binding to the active site of HDLP and HDLP- 
related proteins. The atomic coordinates of apo-HDLP and 
inhibitor-bound HDLP may be used in conjunction with 
computer modeling using a docking program such as GRAM, 
DOCK, HOOK or AUTODOCK (Dunbrack et al . , 1997, Folding & 
Design 2:27-42) to identify potential inhibitors of HDLP and 
HDLP-related proteins (e.g. HDAC1) . This procedure can 
include computer fitting of potential inhibitors to the 
active site of HDLP to ascertain how well the shape and the 
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chemical structure of the potential inhibitor will 
complement the active site or to compare the potential 
inhibitors with the binding of TSA or SAHA in the active 
site. See Bugg et al, 1998, Scientific American 
December: 92-98; West et al . , 1995, TIPS 16:67-74. The 
potential inhibitors designed by modeling with a docking 
program conform to the general formula (I) as described 
above. Computer programs may also be employed to estimate 
the attraction, repulsion and stearic hindrance of the HDLP 
and potential inhibitor compound. Generally, the tighter 
the fit, the lower the stearic hindrances, the greater the 
attractive forces, and the greater the specificity which are 
important features for a specific inhibitory compound which 
is more likely to interact with HDLP and HDLP-related 
proteins rather than other classes of proteins. These 
features are desired particularly where the inhibitory 
compound is a potential antitumor drug. 

The compounds of the present invention may also be designed 
by visually inspecting the three-dimensional structure to 
determine more effective deacetylase inhibitors. This type 
of modeling may be referred to as "manual" drug design. 
Manual drug design may employ visual inspection and analysis 
using a graphics visualization program such as w O" (Jones, 
T.A., Zhou, J.Y., Cowan, S.W., and Kjeldgaard, M. , Improved 
method for building protein models in electron density maps 
and the location of errors in these models, Acta 
Crystallog., A47, 110-119. 

Initially potential inhibitor compounds can be selected for 
their structural similarity to the X, Y and Z constituents 
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of formula (I) by manual drug design. The structural analog 
thus designed can then be modified by computer modeling 
programs to better define the most likely effective 
candidates. Reduction of the number of potential candidates 
is useful as it may not be possible to synthesize and screen 
a countless number of variations compounds that may have 
some similarity to known inhibitory molecules. Such 
analysis has been shown effective in the development of HIV 
protease inhibitors (Lam et al . , 1994, Science 263.: 380-384 ; 
Wlodawer et al . , 1993, Ann. Rev. Biochem. 62 : 543 -585 ; 
Appelt, 1993 Perspectives in Drug Discovery and Design 1:23- 
48; Erickson, 1993, Perspectives in Drug Discovery and 
Design 1:109-128. Alternatively, random screening of an 
small molecule library could lead to potential inhibitors 
whose inhibitory activity may then be analyzed by computer 
modeling as described above to better determine their 
effectiveness as inhibitors. 

The compounds designed using the information of the present 
invention may be competitive or noncompetitive inhibitors. 
These designed inhibitors may bind to all or a portion of 
the active site of HDLP and may be more potent, more 
specific, less toxic and more effective than known 
inhibitors for HDLP and HDLP-related proteins, and 
particularly HDACs. The designed inhibitors may also be 
less potent but have a longer half life in vivo and/or in 
vitro and therefore be more effective at inhibiting histone 
deacetylase activity in vivo and/or in vivo for prolonged 
periods of time. Said designed inhibitors are useful to 
inhibit the histone deacetylase activity of HDLP and HDLP- 
related proteins (e.g. HDAC1) , to inhibit cell growth in 
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vitro and in vivo and may be particularly useful as 
antitumor agents. 

The present invention also permits the use of molecular 
design techniques to computationally screen small molecule 
data bases for chemical entities or compounds that can bind 
to HDLP in a manner analogous to the TSA and SAHA as defined 
by the structure of the present invention. Such 
computational screening may identify various groups which 
may be defined as "X" , "Y" or "Z" of formula (I) above and 
may be employed to synthesize the potential inhibitors of 
the present invention comprising formula (I) . Such 
potential inhibitors may be assayed for histone deacetylase 
inhibitory activity in a histone deacetylase activity assay 
(see Example 3 below) , may be co-crystallized with HDLP to 
determine the binding characteristics through X-ray 
crystallography techniques defined above (e.g. said co- 
crystal structure may be determined by molecular replacement 
to assess the binding characteristics of said potential 
inhibitor) , or may be assessed based on binding activity by 
incubating said potential inhibitor with said HDLP, 
performing gel filtration to separate any free potential 
inhibitor to HDLP-bound inhibitor, and determining the 
amount of histone deacetylase activity of the inhibitor- 
bound HDLP. To measure binding constants (e.g., Kd) , 
methods known to those in the art may be employed such as 
Biacore™ analysis, isothermal titration calorimetry, Elisa 
with a known drug on the plate to show competitive binding, 
or by a deacetylase activity assay. 
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The design of potential inhibitors of the present invention 
is further facilitated by reference to Figure 9, which is a 
surface representation figure that depicts the surface 
grooves. Analysis of such grooves gives insight into the 
constituents of the cap group of formula (I) . The surface 
grooves are labeled groove A, groove A 1 , groove B and groove 
C, into which additional cap groups may bind. The structure 
of HDLP bound to either TSA or SAHA shows that the cap 
groups of TSA and SAHA bind in groove A. By analysis of the 
amino acid sequence identity of HDLP and HDACs, Groove A is 
well conserved in HDACs, has a significant hydrophobic 
component, appears deep enough to allow for significant 
interactions and is also the largest of the four grooves. 
In addition to the dimethylamino phenyl group of the TSA, 
the A groove can fit approximately 200 daltons worth of 
groups (e.g. groove A could accommodate a naphthalene-like 
group after an appropriate spacer, etc.). Groove A, as 
referred to herein, is characterized by the following 
conserved residues of HDLP : His 21, Pro22, Lys24, Phel41, 
Leu265 and Phe335. The periphery of groove A comprises 
unconserved residues. Additionally, Groove A 1 , as referred 
to herein, comprises primarily unconserved residues. 

Groove B is immediately adjacent to the pocket. Of 
significance is that the bottom of groove B comprises the N- 
epsilon nitrogen of Hisl70, which coordinates the zinc 
through its N-delta nitrogen. Significant binding energy 
may be achieved by contacting the Ne proton of Hisl70 with 
a carboxylic acid or sulfate group. In addition, groove B 
may be large enough to fit a phenyl group, the face of which 
may comprise a partial negative charge which may pack over 
the N-epsilon proton of Hisl.70. The conserved residues of 
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groove B, as referred to herein are: Hisl70, Tyrl96 and 
Leu2 65 . 



Groove C is not as well conserved as the other two grooves 
5 and the amino acid residues which comprise groove C are 
mostly polar and solvent exposed. Groove C, as referred to 
herein comprises the following conserved residues: Asn87, 
Glyl40 and Phel98. 

10 The compounds of the present invention are represented by 
formula (I) : 
(I) 



15 




Y 



Examples for suitable X constituents wherein X comprises a 
cap group may be described in three categories, depending 

20 upon which surface of groove A, A', B and/or C they are 
targeted to. The cap group may comprise all three 
categories on the same compound. Of particular benefit may 
be replacing the cap group of TSA or SAHA with a large, 
rigid structure. Nonlimiting examples for suitable cap 

25 groups (X) of formula (I) which may bind in groove A are: 
(1) attaching a 1-3 methyl linker followed by a phenyl or 
naphthalene group from the para or meta position of SAHA' s 
phenyl group represented by formula (IV) : 



30 
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(IV) 



n{H&y 




0 



10 



(2) attaching a 2-3 methyl linker followed by a phenyl or 
naphthalene group from the meta position of TSA's phenyl cap 
group, or from TSA's dimethyl amino group represented by 
formula (V) : 
(V) 



15 



phenyl or 
napthalene 



phenyl or 
napthalene 



n(H 2 C] 




and which may bind in groove B is a 1-3 methyl group spacer 
2 0 followed by a carboxylate, sulfate or phenyl group as 
represented by formula (VI) : 
(VI) 



25 



30 



(CH 2 )n 



-COOH 
-S0 4 

o 



With respect to the aliphatic (Y) group, the diameter of the 
pocket suggests that one more methyl u side chain" could fit, 
in addition to the CI 5 methyl group on the C10 carbon. 
Nonlimiting suitable examples for Y constituents wherein Y 
comprises an aliphatic chain group are as follows: (1) add 
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a methyl group to TSA on the C12 carbon (with or without a 
methyl group on the CIO carbon and with or without double 
bonds and with or without substituting the X and/or Z 
constituents of formula (I) as represented by formula (VII) : 



(2) add a methyl group to TSA on the C9 carbon (with or 
without a methyl group on the CIO carbon; with or without 
both or either of the double bonds, and with or without 
substituting the X and/or Z constituents of formula (I) as 
represented by formula (VIII) : 



(3) replace the two alkalene double bonds of TSA with only 
one between CIO and Cll, which may free the Cll and C12 
torsion to allow for a better fit, the X and/or Z groups may 
also be substituted as represented by formula (IX) : 



(VII) 




(VIII) 




(Branch here) 



(IX) 



14 
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(4) cyclize CI 5 and C12 carbons of TSA through a sulphur 
atom (or nitrogen atom) , the X and/or Z groups may also be 
substituted as represented by formula (X) : 

(X) 



orN 




(5) extend from the C9 carbon of TSA such that the extension 
approaches and/or enters groove B (see Figure 9) ; making C9 
sp3 so that it can have some freedom; attach to C9 a 1-3 
methyl group spacer which may include a double bond and they 
attaching thereto a sulfate, carboxylate, sulfate, hyroxyl, 
or phenyl group which may make an interaction with the N- 
epsilon proton of Hisl70 which may coordinate the zinc atom 
as represented by formula (XI) : 
(XI) 




(6) extend off the C8 carbon (replacing C14) of TSA such 
that the extension approaches or enters groove B; attach a 
1-3 methyl group spacer (which may include a double bond) 
and then link thereto a carboxylate, sulfate, hydroxyl or 
phenyl group such that an interaction is made with the N- 
epsilon proton of Hisl70 that coordinates the zinc atom; the 
X and/or Z constituents may also be substituted as 
represented by formula (XII) : 
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(XII) 



«?H 2 >n (l5 ) 




-COOH 



(7) substitute the C8 carbon at the end of the aliphatic 
chain such that the substitution may contact groove A, A', B 
and or C, in such an example, a cap group (X) may or may not 
be required and the X and Z constituents may be substituted 
as well, as represented by formula (XIII) : 



(8) formulas VII through XIII above wherein the aliphatic 
chain further comprises a methyl group between the active 
site binding group (Z) and the C8 carbon, and preferably 
just before the C8 carbon, increasing the distance between 
X and Z, (9) make the connection between the aliphatic chain 
and the cap group more rigid (e.g., by closing a 6-membered 
ring which may or may not comprise oxygen, the X and Z group 
may also be substituted as represented by formula (XIV) : 



(XIII) 




9 



(XIV) 




o. 



9 
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and (10) combining two or more of the changes depicted by 
formulas (VII -XIV) . 

Additionally, nonlimiting examples for suitable Z groups 
wherein Z comprises an active site binding group are as 
follows: (1) hydroxamic acid, (2) carboxylic acid, (3) 
sulfonamide, (4) acetamide, (5) epoxyketone, (6) an ester 
with a methyl linker and a hydroxyl of acetate ester group 
to lead into the cavity and interact with a conserved 
arginine (Arg27) as represented by formula (XV) : 
(XV) 



CH 2 )n || 



r -oh 



/ 



O CHj 



and (7) an alphaketone as represented by formula (XVI) 
(XVI) 




"(CH 2 )n OH 



"(CH 2 )n- 



< 



-CH 3 



Additionally, other suitable X, Y and Z constituents may be 
envisioned by the skilled artisan given the three- 
dimensional structural information of the present invention. 

After having determined potential suitable X, Y and Z 
constituents, the constituents are combined to form a 
compound of formula (I) using combinatorial chemistry 
techniques. This may be achieved according to U.S. Patent 
Nos. 5,608,108; 5,700,811; 5,773,474; 5,840,960 and 
5,668,179, incorporated herein by reference. Any methods 
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known to one of skill in the art may be employed to 
synthesize compounds of formula (I) comprising X, Y and Z 
constituents as determined by the methods described above. 

As mentioned above, the compounds of formula (I) are useful 
to inhibit the histone deacetylase activity of HDLP and 
HDAC-related proteins. Such inhibition may allow for a 
reduction or cessation of cell growth in vitro and in vivo. 

For in vitro use, such reduction or cessation of cell growth 
is useful to study the role of histone deacetylat ion and 
differentiation during the cell cycle and also to study 
other mechanisms associated with cell cycle arrest and 
particularly how the repression of transcription is involved 
in cell cycle progression which may be studies in a yeast 
model system such as that described by Kadosh & Struhl, 
1998, Mol. Cell. Biol. 18:5121-5127. In vitro model systems 
which may be employed to study the effects of potential 
inhibitors on cell cycle progression and also tumor growth 
include those described by: Richon et al, 1998, Proc . Natl. 
Acad. Sci. USA 95:3003-3007: Yoshida et al . , 1995, Bioessays 
17:423-430; Kim et al . , 1999, Oncogene 18:2461-2470; Richon 
et al., 1996, Proc. Natl. Acad. Sci. USA 93:5705-5708; and 
Yoshida et al . , 1987, Cancer Res. 47:3688-3691. 

For in vivo use, such a reduction or cessation of cell 
growth is useful to study the effect of said inhibitor 
compounds in non- human animal model systems of cancer and is 
also useful for the treatment of cancer in a recipient in 
need of such treatment. Non-limiting examples of animals 
which may serve as non-human animal model systems include 
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mice, rats, rabbits, chickens, sheep, goats, cows, pigs, and 
non-human primates. See, e.g. , Desai et al., 1999, Proc. 
AACR 4_0: abstract #2396; Cohen et al . , 1999, Cancer Res., 
submitted. The compounds of the present invention may be 
5 administered to a transgenic non-human animal wherein said 
animal has developed cancer such as those animal models in 
which the animal has a propensity for developing cancer 
(e.g. animal model systems described in U.S. Patents 
5,777,193, 5,811,634, 5,709,844, 5,698,764, and 5 , 550 , 3 16 ) . 
10 Such animal model systems may allow for the determination of 
toxicity and tumor reduction effectiveness of the compounds 
of the present invention. 

A preferred compound of the present invention may comprise 
15 high specific activity for HDLP and HDAC-related proteins, 
good bioavailability when administered orally, activity in 
reducing or ceasing cell growth in tumor cell lines, and 
activity in reducing or ceasing tumor growth in animal 
models of various cancers. 

20 

Accordingly, another aspect of this invention is a method of 
eradicating or managing cancer in a recipient, which may be 
an animal and is preferably a human. Said method comprises 
administering to said recipient a tumor reducing amount of 
25 a compound as defined by formula (I) above, or a 
physiological acceptable salt thereof. 

In a further aspect of the invention, there is provided a 
composition comprising the compound of formula (I) and an 
30 excipient or carrier. Administration of the foregoing 
agents may be local or systemic. Such carriers include any 
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suitable physiological solutions or dispersant or the like. 
The physiologic solutions include any acceptable solution or 
dispersion media, such as saline, or buffered saline. The 
carrier may also include antibacterial and antifungal 
agents, isotonic and absorption delaying agents, and the 
like. Except insofar as any conventional media, carrier or 
agent is incompatible with the active ingredient, its use in 
the compositions is contemplated. 

Routes of administration for the compositions containing the 
delivery vehicle constructs of the present invention include 
any conventional and physiologically acceptable routes, such 
as, for example, oral, pulmonary, parenteral (intramuscular, 
intraperitoneal, intravenous (IV) or subcutaneous 
injection) , inhalation (via a fine powder formulation or a 
fine mist), transdermal, nasal, vaginal, rectal, or 
sublingual routes of administration and can be formulated in 
dosage forms appropriate for each route of administration. 

The following examples are provided to more clearly 
illustrate the aspects of the invention and are not intended 
to limit the scope of the invention. 

EXAMPLES 

Example 1: Protein Production and Purif ication : 
Full-length wild-type HDLP (Genbank accession number 
AE000719) was subcloned from an Aquifex aeolicus chromosomal 
DNA preparation (provided by Robert Huber of Universitaet of 
Regensburg, Germany) into the pGEX4T3 (Amersham- Pharmacia, 
Piscataway, NJ) vector using the polymerase chain reaction 
(PCR) . The cysteine-to-serine and active site mutants were 
constructed by PCR site directed mutagenesis and were 
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sequenced. The HDLP-glutathione S- transferase (GST) fusion 
protein was produced in Escherichia coli, purified by- 
affinity chromatography using a column of 
glutathione-sepharose resin (Amersham- Pharmacia, Piscataway, 
5 NJ) , and by anion- exchange chromatography (Q-sepharose™; 
Amersham- Pharmacia, Piscataway, NJ) . HDLP was cleaved from 
the fusion protein with thrombin at 4° C, was purified by 
anion- exchange (Q-sepharose™; Amersham- Pharmacia , 

Piscataway, NJ) and gel filtration chromatography 
10 (Superdex™200; Amersham- Pharmacia, Piscataway, NJ) , and was 
concentrated to typically 25 mg/ml in a buffer of 25 mM 
bis-tris propane (BTP) , 500 mM NaCl, 5 mM dithiothrietiol 
(DTT) , 2% isopropanol, pH 7.0. 

15 Although, it is not known what metal cofactor HDLP contains 
in vivo, it is presumed to be zinc because of the 
arrangement of the ligands and the similarities in the 
active site to the zinc proteases. The lack of metal in the 
purified HDLP is presumed due, in part, to the use of. DTT 

20 during purification. HDLP was reconstituted with Zn 2+ by 
mixing the Cys75Ser/Cys77Ser double mutant at 10 mg/ml with 
a 5-fold molar excess of ZnCl 2 in a buffer of 25 mM bis-tris 
propane, 200 mM NaCl, 1% isopropanol, pH 7.0. Unbound ZnCl 2 
was removed by fractionating HDLP through a G25 desalting 

25 column (Amersham- Pharmacia, Piscataway, NJ) . The 

HDLP-Zn 2+ -TSA complex was prepared by incubating the Zn 2+ 
reconstituted HDLP mutant with 1 mM TSA for 45 minutes, 
followed by gel filtration chromatography ( Super dex™2 00 ; 
Amersham- Pharmacia, Piscataway, NJ) to remove excess TSA, V 

30 and concentration to typically 25 mg/ml in a buffer of 25 mM 
bis-tris propane, 500 mM NaCl, 1% isopropanol, pH 7.0. 
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FLAG epitope tagged human HDAC1 was overexpressed using a 
baculovirus expression system in Hi5 (Invitrogen, Carlsbad, 
CA) insect cells grown in suspension in serum- free media 
(Sf900, Gibco, Grand Island, NY) . The fusion protein was 
5 purified by anion exchange and affinity chromatography using 
Ant i- FLAG M2 affinity resin (Sigma, St. Louis, MO) and FLAG 
Peptide (Sigma, , St. Louis, MO) . 



Example 2: Crystallization and data collection: 
10 Crystals of apo-HDLP were grown at room temperature by the 
hanging-drop vapor-dif fusion method, from 7.5% isopropanol, 
28% PEG 1500, 425 mM NaCl , 100 mM Tris-Cl, pH 7.0. They 
form in space group C2 with a = 51.4 A, b = 93.8 A, c = 78.7 
A, p = 96.9 A, and contain one HDLP molecule in the 
15 asymmetric unit. Diffraction data were collected with 
crystals flash-frozen in a buffer of 7.5% isopropanol, 35% 
PEG 1500, 75 mM NaCl, 100 mM Tris-Cl, pH 8.0, at -170° C. 

The structure of the HDLP- Zn 2+ complex was determined from 
20 HDLP Cys75Ser/Cys77Ser double mutant crystals grown from 23% 
tert-butanol, 27% PEG 1500, 400 mM KCl, 100 mM bis-tris 
propane-Cl, pH 6.8. Space group and cell dimensions were 
identical to the apocrystals. The HDLP-Zn 2+ crystals were 
harvested and frozen in 27% tert-butanol, 22% PEG 1500, 50 
25 mM KC1, 20 mM NaCl, 0.2 mM ZnCl 2 , 100 mM bis-tris propane, 
pH 6.8, at -170° C. 



Crystals of the HDLP-Zn 2+ -TSA complex comprised HDLP 
Cys75Ser/Cys77Ser double mutant and were grown from 23% 
30 tert-butanol, 27% PEG 1500, 600 mM KCl , 100 mM bis-tris 
propane-Cl, pH 6.8, by microseeding . The crystals were 
grown in the presence of zinc. They form in space group 
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P2 1 2 1 2 1 with a = 53.4 A, b = 94.4 A, c = 156.3 A and contain 
two HDLP- Zn 2+ -TSA complexes in the asymmetric unit. The 
HDLP-Zn 2+ -TSA crystals were harvested and frozen in the same 
cryobuffer as the HDLP- Zn 2+ crystals except that 0 . 5mM TSA 
5 was added. Data were processed with DENZO and SCALEPACK 
(Otwinowski & Minor, 1997, Method. Ensemble. 276:307-326). 
MIR analysis, model building and refinement. 

The HDLP-Zn 2+ -SAHA complex crystals were grown and evaluated 
10 the same as the HDLP-Zn 2+ -TSA crystals. However, the 
restraints for the SAHA structure were constructed based on 
stereochemical parameters from TSA. Like the apo-HDLP 
crystals, the SAHA/HDLP co-crystals grew in space group C2 . 

15 Heavy- atom soaks were performed with the apo-HDLP crystals 
in a buffer of 7.5% isopropanol, 30% PEG 1500, 75 mM NaCl , 
100 mM Tris-Cl, pH 8.0, supplemented with 1 . 0 mM thimerosal 
for 2h, 5 mM KAu(CN) 2 for lh, and 1 mM Pb(Me) 3 OAc for 2h. 
MIR phases were calculated with the program MLPHARE (The 

20 CCP4 suite: Programs for computational crystallography, 
1994, Acta Crystallogr. D 50:760-763) at 2.5 A using the 
anomalous diffraction signal from the thimerosal derivative, 
and had a mean figure of merit of 0.55. The phases were 
improved by solvent flattening with the program DM (The CCP4 

25 suite: Programs for computational crystallography, 1994, 
Acta Crystallogr. D 50:760-763) , and were used to build 
the initial model with the program O (Jones et al . , 1991, 
Acta Crystallogr. A 42:110-109). Successive rounds of 
rebuilding and simulated annealing refinement with the 

30 program CNS (Brunger et al . , 1998, Acta Crystallogr. D 
5J.:905-921) allowed interpretation of HDLP from residues 2 
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to 373. Residues 1, 374, and 375 were not modeled and are 
presumed to be disordered. 

The structure of the HDLP-Zn 2+ -TSA and HDLP-Zn 2+ -SAHA complex 
were determined by molecular replacement with the program 
AMORE (The CCP4 suite: Programs for computational 
crystallography, 1994, Acta Crystallogr. D 50:760-763) using 
the apo-HDLP structure as a search model. The initial 
electron density maps had strong and continuous difference 
density for the entire TSA molecule. However the SAHA 
molecule was not as well ordered in the cap group region. 
The structure of TSA was obtained from the Cambridge 
Structural Database (Refcode TRCHST) and was used to define 
stereochemical restraints used in the refinement with the 
program CNS . The restraints of SAHA were constructed based 
on stereochemical parameters from TSA and surrounding amino 
acid residues. The dimer interface in the HDLP-Zn 2+ -TSA and 
HDLP-Zn 2+ -SAHA crystals primarily involves Phe200 on the 
protein surface. The Phe200 side chain contacts Tyr91, 
whose side chain conformation changes on TSA binding, and 
part of the dimethyl amino phenyl group of TSA from the 
second protomer. The HDAC family does not contain a 
phenylalanine residue at the equivalent position. 

Example 3: Histone deacetylase assays : 

Purified proteins were assayed by incubating 10 of 
[ 3 H] acetyl -labeled murine erythroleukemia histone substrate 
and HDAC assay buffer (20 mM Tris-HCl, pH 8.0, 150 mM NaCl , 
10% glycerol) for 30-60 minutes at 37° C in a total volume 
of 3 0 fil. The final concentrations of HDLP and HDAC 1 - FLAG 
were 3.6 /zM and 0.24 /xM, respectively. Assays were 
performed in duplicate. The reactions were stopped and the 
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released acetate was extracted and assayed as described 
(Hendzel et al . , 1991, J". Biol. Chem. 266:21936-21942). 
[3H] acetyl-labeled murine erythroleukemia histones were 
prepared essentially as described (Carmen et al . , 1996, J". 
5 Biol. Chem. 271:15837-15844). Inhibitors were added in the 
absence of substrate and incubated on ice for 20 minutes, 
substrate was added, and the assay performed as described 
above. HDLP was inclubated with 20 fiM ZnCl 2 and 20 jzM 
MnCl 2 (H20) 4 in HDAC buffer and tested for activity. 

10 

Only HDLP dialyzed against ZnCl 2 had activity. HDAC 1 - FLAG 
was dialyzed against 20 /iM ZnCl 2 in HDAC buffer which had no 
effect on activity. Therefore, HDAC1-FLAG contains a metal 
as purified. 

15 

The in vivo substrate of HDLP is not known. HDLP may have 
a role in acetoin utilization like the B. subtilis AcuC gene 
product, and it has been annotated as such in the genome 
sequence, but the reaction catalyzed by AcuC is also not 
20 known. Furthermore, the A. aeolicus genome appears to lack 
the acuA and acuB genes that are part of the acuABC operon 
of B. subtilis (Deckert et al . , 1998 Nature 392:353-358), 
and HDLP is as similar to human HDAC1 (35.2 % identity) as 
it is to B. subtilis AcuC (34.7 % identity). 
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What is claimed ig: 

1. A crystal of an enzyme comprising deacetylase activity 
wherein, said crystal effectively diffracts X-rays for 

5 the determination of the atomic coordinates of said 

enzyme to a resolution of greater than 4 A and wherein 
the structure of said enzyme comprises a conserved 
core a/3 structure characteristic fold wherein said 
conserved a/3 fold comprises an eight -stranded parallel 

10 3 sheet and eight a helices and wherein four of the 

helices pack on either face of said parallel 3 sheet 
and wherein said structure of said enzyme comprises an 
rmsd of less than or equal to 1.5 A in the positions of 
Ca atoms for at least 2/3 or more of the amino acids of 

15 HDLP as defined by the atomic coordinates of HDLP. ' 

2. The crystal of claim 1, wherein said protein structure 
further comprises: 

(a) eight a helices positioned near one side of the 3 
20 sheet; and 

(b) at least seven large, well defined loops 
originating from the C-terminal ends of the 3- 
strands of said eight -stranded parallel 3 sheet 
wherein the eight extra helices and the seven 

25 large loops are associated with a significant 

extension of the structure beyond the core a/ 3 
motif and wherein said extension of the structure 
gives rise to a deep, narrow pocket and an 
internal cavity adjacent to the pocket. 

30 

3. The crystal of claim 1, wherein said enzyme comprising 
deacetylase activity is selected from the group 
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consisting of HDLP, HDLP- related proteins, HDAC1 , 
HDAC2 , HDAC3 , HDAC4, HDAC5, HDAC6 , HDAC-related 
proteins, APAH, AcuC, and functional derivatives 
thereof . 



4. The crystal of claim 2 further comprising a 
specifically bound zinc atom in the active site of said 
enzyme . 

5. The crystal of claim 2 further comprising a 
specifically bound deacetylase inhibitor compound in 
the active site of said enzyme. 

6 . The crystal of claim 2 define by the atomic coordinates 
according to Figure 16 . 

7. A method for identifying a potential deacetylase 
inhibitor compound for an enzyme which comprises 
deacetylase activity, said method comprising the steps 
of: 

a. using a three-dimensional structure of HDLP as 
defined by atomic coordinates according to Figure 
16; 

b. employing said three-dimensional structure to 
design or select said potential inhibitor; 

c. synthesizing said potential inhibitor; 

d. contacting said potential inhibitor with said 
enzyme in the presence of an acetylated substrate; 
and 

e. determining the deacetylase inhibitory activity of 
said potential inhibitor. 
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8. The method of claim 7, wherein the three-dimensional 
structure is designed or selected using computer 
modeling. 

5 9. The method of claim 7, wherein the potential 
deacetylase inhibitor is designed de novo. 

10. The method of claim 7, wherein the potential 
deacetylase inhibitor is designed based on a known 

10 inhibitor. 

11. The method of claim 7 , wherein said enzyme comprising 
deacetylase activity is selected from the group 
consisting of HDLP, HDLP-related proteins, HDAC1, 

15 HDAC2 , HDAC3 , HDAC4 , HDAC5 , HDAC6 , HDAC-related 

proteins, APAH, and AcuC. 

12. A method of evaluating the binding properties of the 
potential deacetylase inhibitor compound comprising the 

20 steps of: 

a. co-crystallizing said compound with HDLP; 

b. determining the three-dimensional structure of 
said HDLP-potential inhibitor complex co-crystal 
by molecular replacement using the three - 

25 dimensional structure of HDLP as defined by atomic 

coordinates according to Figure 16; and 

c. analyzing said three-dimensional structure of said 
HDLP bound to said potential inhibitor compound to 
evaluate the binding characteristics of said 

3 0 potential inhibitor compound. 



13. A method for solving the structure of an HDAC family 
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member crystal comprising the steps of : 

a. collecting X-ray diffraction data of said crystal 
wherein said data diffracts to a high resolution 
limit of greater than 4 A; 

b. using the atomic coordinates of HDLP accoding to 
Figure 16 to perform molecular replacement or 
refinement and difference fourier with said X-ray 
diffraction data of said HDAC family member 
crystal to determine the structure of said HDAC 
family member; and 

c. refining said structure of said HDAC family 
member . 

14. The method of claim 13, wherein said HDAC family member 
is HDAC1. 

15. A Cys75Ser/Cys77Ser double mutant of HDLP wherein said 
mutant is encoded by the nucleic acid sequence of SEQ 
ID NO: 4. 

16. A Cys75Ser/Cys77Ser double mutant of HDLP wherein said 
mutant has the amino acid sequence of SEQ ID NO: 3. 

17. A nucleotide sequence according to SEQ ID NO: 4 

18. An expression vector comprising the nucleotide sequence 
of claim 17. 

19. A method of using the crystal of claim 1 for screening 
for a novel drug comprising: 



a. selecting a potential ligand by performing 
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rational drug design with the three-dimensional 
structure determined for the crystal; 

contacting the potential ligand with the ligand 
binding domain of the crystal; and 

detecting the binding potential of the potential 
ligand for the ligand binding domain, wherein the 
novel drug is selected based on its having a 
greater affinity for the ligand binding domain 
than that of a known drug. 
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Figure 4 
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Figure 4d 
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Figure 6 
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Figure 9 
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Figure 1U 

10 20 30 40 

, | 1 1 1 r ■ i i 1 ■ i i i 1 1 i 1 i 1 " " 1 u > ■ I ' ■ i i I " " I 

ATGAAGAAOGTLTAAACTTATCGGAACTITA^ 40 

ACTACAGATATCXXAAAAAOS^^ 80 

ACTTICCCTACTCCTTA^ 120 
ATAGATCAGAAGGAATTAATCAAGAGCAGACCCGCAACTA 160 

AftGAAGAACTCCTITrATI^^ 200 

210 220 230 240 

1 1 , • i ■ ■ i i 1 I .. ....... I ... ■ I ■ i i i I i 1 1 i 1 

CACITIAATQGAAGD3GAA 240 

QSAGCTAQQGAAAACTAC^ 280 

COCTATCTrACGaSATGri^ 320 

QgGTPICAACfiGfI)QCAQQOGATftGR^ 360 

AMCTAQCTTICAMCCOSQQQGAQCT 400 

410 . 420 430 440 

i ■ ■ ■ i . ■ . i I i • • • ' i i 1 1 1 i ■ 1 1 1 1 ■ 1 1 1 ■ ■ 1 1 1 1 1 1 1 I 

Q33Xin3QGAAITGACTA 480 

AGAATAITCTACAIAGACCITSA^ 520 

(jlXnTCMGAAGCXTITrADGATACA^ 560 

CCTQjXXXTICAOCAGTC^^ 600 

610 620 630 640 

! . . . I ■ ■ • ■ I ■ ■ ■ ■ 1 I I I I 1 I ' ■ 1 1 ■ ' 1 1 I ' ' 1 1 I 1 1 1 1 I 

GAGAAQQGCTIXXTGGAOGAGATAQGAGAAGGAAAAGGAA 640 

AQ3QCTACAACCTGAACAT^^ 680 

Q^o^ACGAGirCCTCITIQCXXTA^ 720 

ATAGTCAAAGAAGrTATITCAGCCCXSAQCTIT^ 760 

£ ACI 033AACTGAaXACB^^ 800 

810 820 830 840 

, . . ■ I ■ ■ ■ ■ I ■ ■ ■ ■ ' ■ ■ ■ ■ I ■ ■ ■ ■ I ■ ■ ■ ' I ■ ' ' ' I ' ' ' ' [ 

(jn£AACCTCnCAAACCT 840 

ATOGTITCXnGAGGTTITCQQGGAGQGAG?I^^ 880 

GAQGCQGATACCATCCTTAO^ 920 

CCTAATC7ianXKXSAGCTITCX3aGAA^ 960 

AAQCTAAACAATAAAGCAAAAGAQCTIT^ 1000 

1010 1020 1030 1040 

■ ■ . ■ i ■ u i 1 i i 1 1 1 1 1 ' ■ 1 ' ' ■ ■ 1 ' ' 1 ' * ' ' 1 ' 1 1 1 1 1 ^ 

(XTCGAAACCCTAAAQGACCX^ 1080 
^3GAAAGAAGTAAAGGATiOXTIGAAAAGQQGAAAGCCT 1120 
CATCTIA 1127 
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Figure 11 

10 20 30 40 

■ ■ ■ ' l ■ ■ ■ ■ I 1 ■ ■ i I ' ■ ■ ■ I 1 ■ ■ ■ I ■ ■ 1 1 1 ■ ■ ■ ■ * ■ ■ ■ * I 
MKKVKLICTIIJYGKYRYPKNH^ 40 
rnp ypr ,tk , . c ?p p ate trft ,t ,t .f htfttvtkttt MRATTRrnr\/PK 80 
GAREKYOTGGYENFVSY^ 120 
NVAF^AQGJyiHHAFKSRANGF^^ 160 
RILYIDLIiAHHCDGVQEAF^^ 200 

210 220 230 240 

' ' ' ' 1 ' I 1 ' I 1 ' 1 ' * 1 ' I ' I ' ■ I 1 I ' ' ' i I ' ' ' ' 1 ■ ' ' ' I 

Fpnp r FFTC fi^csfCffW NTPT ,PK^T ^ ^PMEIFT F AT .tot.ct k 240 
iyKEVFEPEVYLLQLGrDFr ,T iFTIYLSKFMjSNVMTLKAFN 280 
I^EVP5EGVYD3G3grt^^ 320 
KLMNKAKFT J ,KSID^ l E^^'iJDEVr^SYMLE^XKDPVM^GEl'• 360 
RKEVKDTLEKAKASS 375 
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10 20 30 40 

■ i ■ 1 1 1 1 1 ■ I ■ ■ < ■ i ■ ■ i ■ I ■ ■ i ■ l ' ■ ■ « I ■ ■ ■ ■ l ' ■ ■ « I 
ATGAAGAAGGn7TAAACTIM03 40 
MJTMZ£GATATOXAAAAAQCATO 80 
AGmraTTACTOTITA^^ 120 
ATASATCAGAAGGAATTAAT^^ 160 
AAGAAGAACICXHTITATia^ 200 

210 220 230 240" 

■ ■ i ■ I i » ■ ■ 1 i i ■ i I ■ ■ ■ ■ I ■ > ■ ■ I ■ ■ ■ ■ I ■ » i ■ 1 ■ » ■ » I 
CACTITAATQG&AGOGGA^^ 240 
QGMXTTAGQGAAA^AC&ACA^^ 280 
(XGTTATCTITACSCGMtOT 320 
033TICAACAJD3CA3QOGATA2AQ 360 
AAIXJEOTITICAATCXX^ 400 

410 420 430 440 

■ ■ ■ ■ 1 i i ■ i I ' ■ ■ ■ 1 ' 1 ■ ■ I 1 ■ ■ * * 1 » ■ ■ I 1 ■ 1 ' 1 ■ ■ » ■ I 

rACATAAACGACCC 440 

HTIAAG 480 

AGAATACICTACATAGAiXTIGA^ 520 
GTIGITCAQGAAGCLTlTr 560 

CCTcnrarrioccA(3ia^ 600 

610 620 630 640 

■ ■ i ■ I i i i 1 1 ■ 1 ■ ■ 1 1 i ■ ■ I ■ ■ ■ ■ 1 ■ ■ ■ ■ I ■ ■ ■ ■ 1 ■ ■ ■ ■ I 
GAGAAGQ3CTIUria^ 640 

CGACAACGAGTtTCX^ 720 
ATAGTCAAAGAAGTTATTI^^ 760 
AACTCGGAACIGAiXCACinriT^ 800 

810 820 830 840 

■ ■ ' ■ I ■ ■ ■ ■ I ■ ■ ■ ' I ■ » ' ■ I ■ ■ ■ ■ I ■ ■ ■ ■ I ■ ■ i ■ I ■ ■ ■ ■ I 
GITCAACCTCTCAAAa?^^ 840 
ATCXnTaJTCAGGITITCXm 880 

a?TAATCT3Gri«3^^ 960 
AAXTAAACAATAAAXAAAAGAGCT 1000 

1010 1020 1030 1040 

J 1 ■ ■ I 1 ■ 1 ■ I ■ ■ ■ ■ 1 ■ ■ ■ ■ I ■ ■ ■ ■ 1 ■ ■ ■ i I i i i ■ I ■ ■ ■ ■ I 
ATITIGAAGAGTTIGAaSAQG 1040 
GCTD3AAACXXnrAAAQ2AC^^ 1080 
AQGAAA3AAGTAAAQGATACQC^^ 1120 
CAICTTA 1127 
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Figure 13 



10 20 30 40 
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HMSELIKSRPATKEEII^^ 80 

T^YIIXJDAHHCDOTEAF^^ 200 

210 220 230 240 

, i i i i i i i i I i ■ i ■ I ■ ■ ■ ■ I i t i i I i i i i I ' < " I ■ ' ' ' I 
EKGFIJEEIGEGKGKCT 240 
IVKEVFEFEVYLLQDGro 280 
IVREAM3EGVYL333G^^ 320 
yUvTTqra vTTT J ^.qTDFEEFmEVDRSY^ILEIli<DPWRQGEV• 360 

RKEVKDTLEKAKASS 375 
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Figure 14 
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40 



AIGAAGAAGGTTAAACTTM03G^ 40 

AGTEACAGATAIUXAAAAACC^ 80 

AGrTTTCCCTiCTCXrTTAQGrr^ 120 

ATAGAIGAGAAQGAATTAAICAAGA^ 160 

M^PGP^nixxrn^ 200 

210 220 230 240 

■ ■ ' ■ 1 ■ 1 1 1 I ' 1 1 1 1 ' ' 1 ' I ' ' I ' 1 I LI I I I I I I ' I ' ' I ' I 

CACTirAATOGAAGaSGAAA^^ 240 

GGAGCTAGOGAAAACTACAAl^ 280 

OCXJCOCIT/OSCKMGTITA^ 320 

QC33nOV^CAGriQCaGGQGAIAG 360 

AATCTAQCnTICAATCCaQQQ3GfiGCT 400 



TTPJ^M3CN3GOOJ^J^ 440 

CX3CnCTG3GAATIGACJrA^ 480 

AGAATACTCTACATAGACXnT^^ 520 

GTUITCAQGAAQCXTITITACX^ 560 

CXTCIUXTTCAO^KJIttX^^ 600 

610 620 630 640 
' ' i ' I ■ ' ' ' I ' ■ ' ■ I ' ' ' ' I ' ' ' i I ■ ■ ■ ■ I « i ■ ■ I ■ ■ ■ ■ I 

GAGAAQGGCTTCCTOGAQ^ 640 

A33GCTA2AACCTCAACAFIOD^ 680 

OGACAACGACTTOC^CITroCX^ 720 

ATAGTICAAAGAAGTIATITCAGC^^ 760 

AACia3GAACIGAODCACTC^^ 800 

810 820 830 840 
■ 1 1 1 1 ■ ■ 1 ■ I 1 1 1 ■ I ■ 1 1 1 I 1 ■ 1 1 1 1 ■ i ■ I i ■ i i I i i » i I 

CTICAftCCTCTCAAACGTTC ^ 840 

ATCGTIT03TClAGGTITia3GGGAQQGACT 880 

GAGGOQGATACCATOTITACmX^ 920 

CCTAAICiraJIQDGAGCTIT^ 960 

AAQCTAAACAATAAAGCAAAAG^ 1000 

1010 1020 1030 1040 
1 ■ 1 1 1 ■ ■ ■ 1 * ■ ■ ■ 1 1 * 1 ■ ■ * ■ • < • 1 1 1 ■ i 1 i • i i 1 1 1 i i 1 

ACTTIGAAGAGITIGACG^^ 1040 

(XnnSAAAOCCTAAAQGAO^^ 1080 

AGGAAAGAAGHAAAQGATAOQCT^ 1120 
CATCTIA 1127 



410 420 
i i i i I i i i i I i i i i I i i i i 1. 1. 



430 440 
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Figure 15 



10 20 30 40 
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MKKVKLIGIliDYGK^ 40 

lEEKELIKSRPATKEE^ 80 

GAREK^GGYENWSYaM^^ 120 

NVaFNPMGMHHAFKSRAN^^ 160 

RILYIIXimcrX^EAFYDITDQV^ 200 

210 ; 220 230 240 

1 1 1 1 1 1 ' " 1 1 1 ' ■ 1 1 i i i 1 i i i i I ■ ■ i i ' i ■ i ■ I . . . ■ I 

ekgfleeigeq<;gkct 240 

IVKEVFEPEVYLIJ^^ 280 
:n/REVI:) 2EX^^ 320 

RKEVKDTLEKAKASS 375 
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Figure 16 

Residue ? X 



ATOM 


- 


CB 


.— .Lo 




45. 


336 


ATOM 


2 


C 


.n.L*> 


2 


46. 


410 


ATOM 


i 


0 


ALA 


2 


45. 


•T30 


ATOM 


4 


:i 


.ALA 


2 


47. 


540 


ATOM 


5 


CA 


ALA 


2 


46. 


568 


ATOM 


6 


N 


LYS 


3 


46. 


350 


.-mL\Jll 


7 


CA. 


.LYS 




46. 


637 




3 


C3 


LYS 


2 


47 . 


= 55 




q 


C3 


1YS 




49. 


217 


ATOM 


* 0 


Q2 


1YS 




50. 


315 


* TOM 


I T 
— A, 


< — 


LYS 




51. 


~; 20 


- 

.-. a On 


* *> 
1Z 




— * J 




52 . 


791 


ATOM 


" "3 
- J 


L. 






45 . 


407 


.-. A Oil 


a ** 


Q 


LYS 




44 . 


984 


ATOM 




M 
i J 






44 . 


314 


rtTOM 


X 0 








43 . 


335 


ATOM 


A / 




'.*AL 
« oaj 


- 


42 . 


501 


ATOM 


0 




" f AL 




41 . 


214 


ATOM 


1 0 




If "kXJ 




42 . 


244 


ATOM 




v> 


''XL. 




43.. 


983 


ATOM 




.-. 


7AL 




44 . 


557 


.-„ A UW 


& Z 




LYS 




43 . 


654 


ATOM 


-1 J 


<— 2 


- ye 

—•AO 




44 . 


052 


ATOM 


7 4 


— — - 


' -'C 
— AO 




45 . 


C47 


* TOM 


- c 




LYS 




46. 


2 j! 


* TOM 


~ 0 


111 


LYS 




47. 


153 


* TOM 


? *7 
/ 


v_ — 


1YS 




43 . 


310 


- .T*OM 

.- a OM 


- o 
• O 


% V7 


LYS 


c 


49 . 


3 51 


ATOM 


■5 Q 
J 


r- 


- ye 


c 


42 . 


914 


^ TOM 


7 n 

J u 


O 


LYS 


5 


41. 


949 


•« TOM 

ATOM 


J i 


LI 


? err 


6 


43 . 


071 


ATOM 


J Z 






5 


42 . 


097 


ATOM 


J J 






6 


41 . 


571 


ATOM 








5 


40 . 


273 


ATOM 


■a C 


\— — 


LEU 


6 


40 . 


079 


* TOM 

.--TOM 


J 0 




LEU 


6 


40 , 


657 


• TOM 


37 




LEU 


6 


42 . 


964 


* TOM 


3 8 




LEU 


6 


43 . 


911 


TOM 


39 


•J 






42 


359 


* TOM ' 
.*\ A U/i'I 


40 




~~*E 


7 


43 


045 


* TOM 

.-*. a ^ n 


4 1 


25 


"LE 


7 


42 


922 


1TOM 


42 


222 


:le 




43 


920 


ATOM 


43 


221 


:le 


- 


43 


253 


."V A Wi . 


44 




:le 


- 


43 


236 


ATOM 


45 


3" ~ 


;le • 


- 


42 


396 


ATOM 


46 


2 


:le 




41 


138 


ATOM 


47 


:: 


3LY 


3 


43 


193 


ATOM 


48 


• CA 


JLY 


S 


42 


623 


* TOM 


49 




3LY 


3 


43 


640 


ATOM 


50 


7, 


SLY 


3 


44 


849 


ATOM 


51 


:i 


THR 


0 


43 


134 


nTOM 


52 


CA 


THR 


9 


43 


950 


ATOM 


53 


C3 


THR 


9 


44 


739 


ATOM 


:4 


• 3 « 


THR 


a 


45 


321 


ATOM 


55 


C32 


THR 


9 


43 


S23 


ATOM 


56 




THR 


9 


43 


025 


ATOM 


57 




THR 


9 


41 


S72 


ATOM 


5o 




1EU 


10 


43 


= 17 


ATOM 


59 


CA 


LEU 


10 


42 


690 


ATOM 


50 


Z3 


LEU 


10 


43 


.256 


ATOM 


51 


21 


LEU 


;o 


43 


• ^ 2 


ATOM 


62 




LEU 


10 


-» j. 


' f SO 


ATOM 


63 




LEU 


10 


43 


.938 


ATOM 


64 




LEU 


10 


42 


.556 


.AT CM 


65 




LEU 


10 


41 


."36 


ATOM 


66 




ASP 


11 


43 


.37^ 



-1 



Y 




2 




OCC. 


3 




Segment I 


35. 


8S0 


- z 


042 




= 9. 


90 


AAAA 


38. 


631 


- *: 


52S 


1.32 


52. 


s 7 




39. 


= 95 


"*4 


052 


1. ::■ 


52. 


45 


AAAA 


37. 


826 


"5. 


573 


1. :: 


35. 


22 


AAAA. 


37. 


432 




527 


1. :: 


57 . 


22 


AAAA 


38. 


570 


^ . 


389 


- -> 


2?: 


51 


AAAA 


39. 


669 




440 


1.20 


19 . 


55 


AAAA 


39 . 


763 


— -. 


459 




25 . 


02 


AAAA 


40. 


007 


-7 ' 


102 




- z 


* 5 


AAAA 


40 . 


0 0 0 


" j . 


03 9 


- - - 


66 . 


25 


AAAA 


40 . 


153 




655 




'3 . 


41 


AAAA 


40 . 


047 


i 9 . 


642 


" "0 


59 . 


54 


AAAA 


3 9 . 


422 


- j . 


542 


1 . 2-0 


23 . 


29 


AAAA 


38 . 


L, O i- 


- 3 . 


487 


1 . 30 




41 


AAAA' 


40 


4 Q P 


- 0 . 


128 


1 . 30 


25 . 


IS 


AAAA 


4 O 




59 . 


349 


1.30 




20 


AAAA 


4 1 

** A. . 


„ 0 j 


69 . 


887 


1 . 00 


21 . 


46 


AAAA 


4 1 . 


202 


69 . 


066 


1.20 


25 


35 


AAAA 




080 




548 


2.22 


24 . 


98 


AAAA 


40 . 


£51 


67 . 


961 


1.20 


25 . 


22 


AAAA 


41 . 


w — ' 


67 


7*3 


1.30 




IS 


AAAA 


■* \j . 


V t ■ 


65 


973 


1.32 


21 


52 




40 


-1 Q1 
«. - — 


55 . 


507 


1.20 


10 


10 


AAAA 


t a 
- - * • 


214 


- ^ 


1"*" 




25 


3 5 


AAAA 




u - 


■55 . 


049 




13 


T 3 


.AAAA 


40 


- " 4 


65 . 


919 


1.22 


23 


7 0 


AAAA 


40 


_ — _ 


65 


659 


1 . Z 0 


2 4 


3 4 


AAAA 


4! 




65 


585 






04 


AAAA 


iO 


Gil 


54 


596 


1.10 


2 2 




AAAA 




c s 

x - m 


£4 


728 


1.20 


15 


48 


AAAA 


4 1 


A — A 


53 


564 


1.20 


19 


28 


AAAA 




— — 0 


52 


483 


1 . CO 


2 2 


. 68 


AAAA 


HZ , 








2 . 20 


2 2 


51 


AAAA 


* 9 


7 12 




j ** ^ 


1.30 


3 0 


. 59 


AAAA 


- A 
t *» 


1 a*? 

JL — — 


w X 


ic: 

. X w «» 


1.20 


2 9 


. 90 


AAAA 


* 9 


u O - 


£ 3 
— .» 


oa^ 
.-'-'-» 


1.20 


3 3 


. 98 


.-AAA 


40 


701 


5 1 


.227 


2.2C 


13 


. 17 


AAAA 


41 


249 


6 0 


. 919 


1 . 20 


22 


.21 


AAAA 


39 


689 


5 0 


. 528 


1 . 20 


19 


. 15 


AAAA 


3 9 


199 


39 


.338 


1 . 20 


13 


.35 


AAAA 


37 


674 


3 9 


. 191 


2 . 20 


1? 


. C5 


AAAA 


" 7 


. 162 


5 3 


. 144 


2.22 


16 


. 45 


AAAA 


37 


. CO 7 


50 


. 521 


- 




. 21 


.—AAA 


c 


.343 


50 


. 450 


1 . - 0 


5 4 


. 99 


AAAA 


3 9 


.5 50 


33 


. 125 


2 . 20 




. 95 


AAAA 


39 


.729 


57 


. 928 


1 . 20 


19 


.07 


AAAA 


40 


.552 


37 


.230 






.*70 


AAAA 


41 


.193 


56 


. 148 


2 . 20 


1 = 


. 11 


AAAA 


41 


.557 


55 


.243 


1 . 20 


2 2 


.91 


AAAA 


4 1 


. S40 


3 5 


. 504 


1 . 20 


22 


.27 


AAAA 


42 


. 423 


5 4 


. 153 


1.20 


22 


.99 


AAAA 


43 


. 141 


53 


. 133 


1 . 20 


25 


. 95 


AAAA 


42 


. 195 


52 


,263 


1 . 20 


25 


. 30 


AAAA 


42 


.962 


s ? 


. 199 


1 . 20 


25 


.55 


AAAA 


41 


.144 


31 


.557 


2.20 


25 


.24 


AAA* 


43 


.957 


32 


.294 


1 . 00 


23 


. 04 


AAA? 


43 


.382 


52 


.082 


1.20 




.05 


AAA? 


45 


.079 


31 


.781 


1. 30 


29 


.19 


AAA? 


45 


.596 


50 


.895 


1. 20 


22 


.55 


AAA/ 


47 


.319 


30 


.761 


1.00 


25 


.09 


AAA, 


48 


.266 


51 


.953 


2. 20 


33 


.00 


AAA, 


48 


.403 


32 


.247 


1. 30 


25 


.65 


AAA- 


■» » 


."44 


33 


.126 


1. 20 


* -. 


.22 


AAA. 


45 


.251 


49 


.512 


1. 20 


32 


.58 


AAA 


45 


.684 


48 


.702 


1. 20 


25 


.97 


AAA 


44 


.224 


49 


.236 


1. 20 


25 


.75 


AAA 



WO 01/18045 



PC17USOO/24700 



20/263 
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Figure 16-14 
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Figure 16- 
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Figure 17-1 
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Figure 17-8 
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Figure 17-9 
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Figure 17-10 
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Figure 17-20 
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Figure 17-21 



PCT/USOO/24700 



ATOM ' 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM- 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



1321 
1322 
1323 
1324 
1325 
1326 
1327 
1328 
1329 
1330 
1331 
1332 
1333 
1334 
1335 
1336 
1337 
1338 
1339 
1340 
1341 
1342 
1343 
1344 
1345 
1346 
1347 
1348 
1349 
1350 
1351 
1352 
1353 
1354 
1355 
1356 
1357 
1358 
1359 
1360 
1361 
1362 
1363 
1364 
1365 
1366 
1367 
1368 
1369 
1370 
1371 
1372 
1373 
1374 
1375 
1376 
1377 
1378 
1379 
1380 
1381 
1382 
1383 
1384 
1385 
1386 



CD1 
CD2 
C 
O 
N 

CA 
CB 
CG 
OD1 
OD2 
C 
0 
N 

CA 
CB 
C 
O 
N 

CA 
CB 
CG 
CD2 
ND1 
CE1 
NE2 
C 
O 
N 

CA 

CB 

CG 

CD2 

ND1 

CE1 

NE2 

C 

O 

N 

CA 

CB 

SG 

C 

O ' 
N 

CA 
CB 
CG 
OD1 
OD2 
C 
O 
N 

CA 
C 
0 
N 

CA 
CB 
CGI 
CG2 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



O 
N 

CA 
CB 
CG 



LEU 
LEU 
LEU 
LEU 
ASP 
ASP 
ASP 
ASP 
ASP 
-ASP 
ASP 
ASP 
ALA 
ALA 
ALA 
ALA 
ALA 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS A 
HIS A 
HIS A 
HIS A 
HIS A 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
CYS A 
CYS A 
CYS A 
CYS A 
CYS A 
CYS A 
ASP A 
ASP A 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
GLY 
GLY 
GLY 
GLY 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
GLN 
GLN 
GLN 
GLN 



167 
167 
167 
167 
168 
168 
168 
168 
•168 
168 
168 
168 
169 
169 
169 
169 
169 
170 
170 
170 
170 
170 
170 
170 
170 
170 
170 
171 
171 
171 
171 
171 
171 
171 
171 
171 
171 
172 
172 
172 
172 
172 
172 
173 
173 

rz 

I' 3 
1" J 
173 
173 
173 
174 
174 
174 
174 
175 
175 
175 
175 
175 
175 
175 
176 
176 
176 
176 



24 .741 
23 .225 
23.162 
22.386 
23.726 

23 .410 

24 .057 
23.937 
24.892 
22.893 
21.906 
21.354 
21.239 
19.793 
19.233 
19.420 
18.266 
20.405 
20.180 
20.667 
20.122 
19.338 
20.384 
19.784 
19.143 
20.895 
21.913 
20.349 
20.893 
19.934 
20.468 
20.674 
20.823 

. 21.222 
21.140 
22.267 
22.540 
23.131 
24 .467 
25.497 
25.005 
24.484 
25.203 
23.664 
23.542 
22.735 
23.281 
22.539 
24.454 
24.872 
24.940 
25.926 
27.227 
27.896 
28.443 
27.848 
28.459 
28.536 
29 .449 
29.015 
27.647 
28.173 
26.356 
25.518 
24.045 
23.084 



34.647 
32.661 
33.660 
34.613 
33.208 
33.787 
32.987 
33.676 
34.388 
33.531 
33.614 
32.648 
34.524 
34.415 
35.640 
33.157 



32.752 
32.542 
31.327 
31.501 
32.711 
32.834 
33.995 
34.858 
34.180 
30.113 
30.234 
28.939 
27.655 
26.532 
25.148 
24.123 
24.678 
23.424 
23.062 
27 .413 
27.863 
26.705 
26.389 
26.474 
25.631 
24.997 
24.098 
24.839 
23.593 
23.857 
25.030 
25.558 
25.417 
22.932 
21.708 
23.737 
23.198 
22.505 
21.408 
23.144 
22.602 
23.672. 
23.192 
24.989 
21/409 
20.512 
21.404 
20.303 
20.611 
19.483 



65.552 
65.302 
60.951 
60.971 
59.828 
58.520 
57.290 
56.037 
55.659 
55.364 
58.408 
58.948 
57.711" 
57.579 
56.879 
56.813 
56.824 
56.156 
55.375 
53.936 
53.245 
52 .147 
53.675 
52.873 
51.939 
55.958 
56:637 
55.658 
56.090 
55.663 



1.00 26.24 
1.00 23.45 
1.00 26.37 
1.00 25,95 
1.00 29.66 
1.00 28.35 
1.00 33.29 
1.00 35 
1.00 39 
1.00 33 
1.00 28 



38 
48 
40 
74 



55.889 

55.028 

57.137 

57 .036 

55.767 

55.471 

54.356 

56.190 

55.683 

56.812 

58.318 

55.048 

55.483 

54.015 

53.269 

51.993 

51.179 

50.330 

51.372 

52.922 

52.784 ■ 

52.793 

52.447 

53.612 

53.462 

54.778 

55.989 

57 .101 

58.218 

56.530 

56.505 

57.173 

56.203 

56.629 

56.355 

56.726 



00 26.21 
00 26.16 
00 24.39 
00 22.75 
00 24.37 
00 22.34 
00 25.78 
1.00 25.20 
1.00 25.76 
1.00 29.08 
1.00 30.59 
1.00 30.77 
1.00 29.07 
1.00 32.19 
1.00 26.00 
1.00 25.76 
1.00 27.29 
1.00 25.01 
1.00 24.93 
1.00 26.56 
1.00 22.34 
1.00 25.35 
1.00 22.68 
1.00 24.13 
1.00 24.74 
1.00 28.22 



1 .00 23 .03 
1.00 23.41 
1.00 19.31 
1.00 16.78 
1.00 25.45 
1.00 24.47 
1.00 26.67 
1.00 26.47 
1.00 26.33 
1.00 27.06 
1.00 23.43 
1.00 29.38 
1.00 26.65 
1.00 28.38 
1.00 25.24 
1.00 23.11 
1.00 25.64 
1.00 27.67 
1.00 24.29 
1.00 22.20 
1.00 20.15 
1.00 20.11 
1.00 18.74 
1.00 22.85 
1.00 20.07 
1.00 24.12 
1.00 27.18 
1.00 32.86 
1.00 36.04 



6 

6 

6 

8 

7 

6 

6 

6 

8 

8 

6 

8 

7 

6 

6 

6 - 

8 

7 

6 

6 

6 

6 

7 

6 

7 

6 

8 

7 

6 

6 

6 

6 

7 

6 

7 

6 

8 

7 

6 

6 
16 
6 
8 
7 
6 
6 
6 
8 
8 
6 
8 
7 
6 
6 
8 
7 
6 
6 
6 
6 
6 
8 
7 
6 
6 
6 



WO 01/18045 



89/263 

Figure 17-22 
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Figure 17-40 
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Figure 17-42 
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Figure 18-1 

Residue //X Y Z B Segment ID 

ATOM 1 CS ALA A 2 46.726 14.971 138.208 1.00 56 80 

ATOM 2 C ALA A 2 47.943 12.813 138.561 1.00 58 93 

ATOM 3 0 ALA A 2 48.857 13.292 137.884 1.00 60.99 

ATOM 4 N ALA A 2 46.995 14.046 140.488 1.00 56.88 

ATOM 5 CA ALA A 2 46.801 13.697 139.052 1.00 59.41 

ATOM- 6 N LYS A 3 47.890 11.525 138.903 1.00 53 81 

ATOM 7 CA LYS A 3 48.937 10.591 138.492 1.00 53.62 

ATOM 8 CB LYS A 3 48.736 9.229 139.156 1.00 50.26 

ATOM 9 CG LYS A 3 48.917 9.279 140.665 1.00 56.64 

ATOM 10 CD .LYS A 3 48.950 7.891 141.285 1.00 57.18 

ATOM 11 CE LYS A 3 49.160 7.964 142.796 1.00 56.74 

ATOM 12 NZ LYS A. 3 50.423 8.663 143.165 1.00 54.86 

ATOM 13 C LYS A 3 49.063 10.430 136/986- 1.00 49.95 

ATOM 14 0 LYS A 3 48.088 10.562 136.248 1.00 44.34 

ATOM 15 N VAL A 4 50.287 10.147 136.550 1.00 46.01 

ATOM 16 CA VAL A 4 50.609 9.985 135.142 1.00 42.48 

ATOM 17 CB VAL A 4 51.901 10.755 134.809 1.00 43.42 

ATOM 18 CGI VAL A 4 52.179 10.713 133.307 1.00 39.20 

ATOM 19 CG2 VAL A 4 51.773 12.186 135.310 1.00 39.34 

ATOM 20 C VAL A 4 50.787 8.510 134.806 1.00 38.41 

ATOM 21 0 VAL A 4 51.659 7.839 135.351 1.00 37.08 

ATOM 22 N LYS A 5 49.959 8.011 133.899 1.00 37.79 

ATOM 23 CA LYS A 5 50.016 6.610 133.515 1.00 38.17 

ATOM 24 CB LYS A 5 .48.700 5.915 133.887 1.00 38.40 

ATOM 25 CG LYS A 5 48.411 5.803 135.385 1.00 42.84 

ATOM 26 CD LYS A 5 49.384 4.855 136.070 1.00 44.10 

ATOM 27 CE LYS A 5 49.017 4.632 137.534 1.00 45.97 

ATOM 28 NZ LYS A 5 49.045 5.894 138.322 1.00 51 78 

ATOM 29 C LYS A 5 50.275 6.392 132.030 1.00 38.31 

ATOM 30 O LYS A 5 49.992 7.253 131.201 1.00 38.13 

ATOM 31 N LEU A 6 50.817 5.220 131.717 1.00 35.05 

ATOM 32 CA LEU A 6 51.082 4.818 130.346 1.00 31.46 

ATOM 33 CB LEU A 6 52.582 4.592 130.133 1.00 28.46 

ATOM 34 CG LEU A 6 . 53.094 4.256 128.720 1.00 30.91 

ATOM 35 CD1 LEU A 6 52.618 2.884 128.295 1.00 33.05 

ATOM 36 CD2 LEU A 6 52.630 5.312 127.744 1.00 21.96 

ATOM 37 C LEU A 6 50.307 3.512 130.164 1.00 30.50 

ATOM 38 O LEU A 6 50.453 2.581 130.955 1.00 32.82 

ATOM 39 N ILE A 7 49.459 3.456 129.145 1.00 26.94 

ATOM 40 CA ILE A 7 48.676 2.255 128.893 1.00 28.29 

ATOM 41 CB ILE A 7 47.218 2.598 128.493 1.00 28.94 

ATOM 42 CG2 ILE A 7 46.499 1.343 128.041 1.00 32.57 

ATOM 43 CGI ILEA 7 46.447 3.172 129.688 1.00 36.59 

ATOM 44 CD1 ILE A 7 46.979 4.468 130.236 1.00 46.80 

ATOM 45 C ILE A 7 49.341 1.470 127.770 1.00 31.09 

ATOM 46 0 I" A 7 49.600 2.009 126.695 1.00 27.65 

ATOM 47 N GL; A 8 49.638 0^201 128.029 1.00 27.30 

ATOM 48 CA GL. A 8 50.277 -0.614 127.016 1.00 25.50 

ATOM 49 C GLY A 8 50.578 -2.024 127.480 1.00 30.66 

ATOM 50 0 GLY A 8 50.224 -2.421 128.592 1.00 30.02 

ATOM 51 N THR A 9 51.238 -2.777 126.611 1.00 28.94 

ATOM 52 CA THR A 9 51.614 -4.156 126.877 1.00 33.63 

ATOM 53 C3 THR A 9 50.393 -5.083 126.857 1.00 36.19 

ATOM 54 OG1 THR A 9 50.827 -6.441 126.992 1.00 34.87 

ATOM 55 CG2 THR A 9 49.633 -4.931 125.548 1.00 36.49 

ATOM 56 C THR A 9 52.567 -4.637 125.794 1.00 34.83 

ATOM 57 0 THR A 9 52.545 -4.133 124.677 1.00 36.91 

ATOM 58 M LEU A 10 53.407 -5.609 126.129 1.00 39.15 

ATOM 59 CA LEU A 10 54.345 -6.167 125.164 1.00 40.21 

ATOM 60 CB LEU A 10 55.402 -7.009 125.881 1:00 42.40 

ATOM 61 CG LEU A 10 56.482 -6.282 126.687 1.00 42.29 

ATOM 62 CD1 LEU A 10 55.870 -5.293 127.647 1.00 42.92 

ATOM 63 CD2 LEU A 10 57.319 -7.306 127.424 1.00 40.29 

ATOM 64 C LEU A 10 53.591 -7.039 124.159 1.00 41.70 

ATOM 65 0 LEU A 10 54.055 -7.266 123.044 1.00 37.13 

ATOM 66 N AS? A 11 52.419 -7.519 124.557 1.00 47.28 
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Figure 18-2 
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Figure 18-3 
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Figure 18-4 
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Figure 18-6 
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Figure 18-7 
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Figure 18-48 
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Figure 18-54 
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Figure 18-85 

68.700 20.144 67.057 1.00 57.24 

67.936 19.062 66.280 1.00 55.24 

66.738 19.588 65.558 1.00 55.31 

65.662 18.971 69.098 1.00 43.44 

65.211 18.736 67.978 1.00 43.03 

64.947 19,512 70.076 1.00 39.45 

63.563 19.885 69.875 1.00 40.31 

62.846 20.034 71.215 1.00 40.88 

62.943 18.901 72.234 1.00 40.09 

62.001 19.175 73.388 1.00 38.17 

62.588 17.596 71.580 1.00 41:56 

63.615 21.244 69.197 1.00 41.23 

64.466 22.070 69". 53*1 1.00 39.22 

62.735 21.473 68.233 1.00 40.04 

62.703 22.771 67.582 1.00 43.32 

61.985 22.707 66.234 1.00 41.53 

60.617 22.085 66.335 1.00 41.89 

59.889 22.308 67.304 1.00 39.79 

60.243 21.317 65.317 1.00 40.43 

61.949 23.690 68.532 1.00 44.76 

61.402 23.237 69.539 1.00 45.80 

61.902 24.973 68.210 1.00 46.85 

61.234 25.930 69.076 1.00 47.60 

61.460 27.348 68.549 1.00 50.87 

61.089 28.407 69.552 1.00 55.06 

59 925 28.565 69.919 1.00 60.68 

62.091 29.121 70.048 1.00 59.17 

59.740 25.664 69.249 1.00 43.97 

59 1^0 25.898 70.322 1.00 41.33 

59.087 25.168 68.201 1.00 43.49 

57.655 24.892 68.264 1.00 45.95 

57 112 24.415 66.909 1.00 48.97 

57.212 25.400 65.731 1.00 53.41 

58.582 25-. 386 65.024 1.00 58.77 

59 700 26.013 65.846 1.00 58.10 

61.024 25.906 65.178 1.00 53.38 

57 368 23.822 69.309 1.00 45.79 

56 375 23.891 70.034 1.00 43.91 
58.245 22.829 69.381 1.00 44.28 
58.078 21.746 70.336 1.00 44.25 
59.013 20.589 69.986 1.00 41.44 
58.342 22.223 71.757 1.00 40.92 

57 639 21.S43 72.688 1.00 39.02 
59.352 23.085 71.922 1.00 38.14 
59.689 23.603 73.246 1.00 40.11 
60.892 24.552 73. 1^8 1.00 42.36 
62.174 23.922 72.:59 1.00 45.78 
63.325 24.926 72.e"5 1.00 48.46 
64.594 24.367 72.031 1.00 49.62 
65 108 23.139 72.700 1.00 48.53 
58.500 24.338 73.841 1.00 39.17 
58.132 24.112 74.994 1.00 38.87 
57 898 25.215 73.048 1.00 41.06 
56 750 25.986 73.512 1.00*42.35 

56 357 27.028 72.463 1.00 44.02 

57 434 28.084 72.258 1.00 44.80 
=7 835 28.742 73.569 1.00 48.40 m 
56 949 29.317 74.237 1.00 51.20 
59 029 28.680 73.935 1.00 47.81 
55 569 25.087 73.839 1.00 38.67 
-4 794 25.377 74.750 1.00 41.20 
=5 429 23.999 73.090 1-00 35.31 
54 349 23.056 73.334 1.00 32.69 
54 404 21.900 72.334 1.00 35.06 
53 344 20.813 72.544 1.00 35.01 
51 958 21.430 72.419 1.00 36.90 
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53.521 19.699 71.525 1.00 32.36 
54 504 22.507 74.747 1.00 34.07 
53.621 22.664 75.583 1.00 30.53 
55.640 21.873 75.013 1.00 32.74 
55.889 21.311 76.330 1.00 34.99 
57.267 20.642 76.382 1.00 37.01 
57.466 19.428 75.470 1.00 34.91 
58.832 18.817 75.728 1.00 34.69 
56.369 18.396 75.742 1.00 34.10 
^5 789 22.363 77.429 1.00 37.12 
55.210 22.110 78.482 1.00 34.19 
56.353 23.540 77.186 1.00 34.34 
56.313 24.604 78.181 1.00 43.35 

57 162 25.788 77.712 1.00 46.25 

58 658 25.496 77.685 1.00 51.07 

59 482 26.610 77.021 1.00 49.96 
59 371 27.957 77.733 1.00 53.08 
58.013 28.569 77.662 1.00 56.18 
54 892 25.069 78.494 1.00 42.06 
54.588 25.416 79.631 1.00 43.05 
54.018 25.056 77.492 1.00 44.54 
52 639 25.502 77.679 1. 00 46.58 
51.975 25.751 76.329 1.00 48.75 
51.769 24.527 75.646 1.00 49.55 
51.780 24.507 78.451 1.00 49.56 

50 618 24.791 78.749 1.00 46.67 
52 341 23.345 78.770 1.00 50.55 

51 586 22.326 79.488 1.00 51.93 
52.259 20.945 79.376 1.00 51.82 
51.447 19.902 80.134 1.00 50.29 
52.359 20.539 77.905 1.00 52.18 
53^044 19.210 77.693 1.00 55.42 
51 367 22.634 80.964 1.00 51.45 
*2 180 23/290 81.614 1.00 50.96 
50.245 22.141 81.472 1.00 54.05 
49 850 22.306 82.865 1.00 58.15 
48.320 22.216 82.959 1.00 60.38 
47 751 20.972 82.262 1.00 63.85 

48.017 19.833 82.710 1.00 59:16 
47 033 21.138 81.252' 1.00 59.71 
50.506 21.207 83.701 1.00 55.47 
49 833 20.291 84.171 1.00 54.08 

51 816 21.307 83.906 1.00 54.60 

52 524 20.266 84.641 1.00 56.60 
53.718 19.784 83.811 1.00 53.01 

54 522 18.717 84.482 1.00 49.30 

53 898 17.589 85.008 1.00 45.61 

55 901 18.843 84.605 1.00 46.83 

54 637 16.600 85.651 1.00 45.95 

56 651 17.860 85.247 1.00 46.02 

56.018 16.737 85.772' 1.00 46.08 
52 971 20.559 86.072 1.00 57.29 
52 197 20.378 87.012 1-00 63.54 
54.223 20.983 86.229 1.00 55.21 
54.818 21.286 87.535 1.00 60.30 
53.783 21.846 88.517 1.00 64.95 
54.375 22.225 89.867 1.00 71.50 
53.363 22.882 90.787 1.00 75.37 
52.796 23.925 90.394 1.00 75.32 
53.137 22.361 91.901 1.00 76.84 
55.485 20.058 88.146 1.00 55.66 
54.823 19.093 88.529 1.00 49.97 
^6.807 20.125 88.240 1.00 54.26 
57.630 19.047 88.767 1.00 54.35 
59 101 19 457 88.635 1.00 54.08 
60!074 18.315 88.514 1.00 54.15 
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.256 
.435 
.225 
.805 
.051 
.976 
.309 
.878 
.379 
.852 
.402 
.692 
.759 
.306 
.248 
.021 
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1.00 27.89 
1.00 27.23 
1.00 16.24 
1.00 27.18 
1.00 25.98 
1.00 24.59 
1.00 25.75 
1.00 26.53 
1.00 28.36 
1.00 25.27 
1.00 24.18 
1.00 27.20 
1.00 30.21 
1.00 27.18 
1.00 23.47 
1.00 28.28 
1.00 28.83 
1.00 26.16 
1.00 27.37 
1.00 25.36 



1.00 12.93 
1.00 7.53 
1.00 12.33 
1.00 14.54 
1.00 11.14 
1.00 22.41 
1.00 21.99 
1.00 26.67 
1.00 23.94 
1.00 26.96 
1.00 22.51 
.00 32.15 
,00 19.47 
.00 18.81 
.00 26.11 
.00 24.28 
.00 18.30 
1.00 22.27 
1.00 24.68 
1.00 30.00 
1.00 17.32 
1.C0 27.38 
1.00 21.73 
1.00 34.17 
i.00 40.67 
1.00 27.92 
1.00 24.31 
1.00 30:93 
1.00 29.34 
1.00 22.61 
1.00 29.85 
1.00 36.40 
1.00 18.70 
1.00 30.02 
1.00 28.03 
1.00 26.46 
1.00 22.31 
1.00 30.83 
1.00 30.98 
1.00 38.13 
1.00 29.24 
1.00 46.34 
1.00 23.11 
1.00 27.08 
1.00 42.42 
1.00 38.97 
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HETATM 6073 OH2 WAT G 47 

HETATM 6074 OH2 WAT G 48 

HETATM 6075 OH2 WAT G 49 

HETATM 6076 OH2 WAT G 50 

HETATM 6077 OH2 WAT G 51 

HETATM 6078 OH2 WAT G 52 

HETATM 6079 OH2 WAT G 53 

HETATM 6080 OH2 WAT G 54 

HETATM 6081 OH2 WAT G 55 

HETATM 6082 OH2- WAT G 56 

HETATM 6083 OH2 WAT G 57 

HETATM 6084 OH2 WAT G 58 

HETATM 6085 OH2 WAT G 59 * 

HETATM 6086 OH2 WAT G 60 

HETATM 6087 OH2 WAT G 61 

HETATM 6088 OH2 WAT G 62 

HETATM 6089 OH2 WAT G 63 

HETATM 6090 OH2 WAT G 64 

HETATM 6091 OH2 WAT G 65 

HETATM 6092 OH2 WAT G 66 

HETATM 6093 OH2 WAT G 67 

HETATM 6094 OH2 WAT G 68 

HETATM 6095 OH2 WAT G 69 

HETATM 6096 OH2 WAT G "?0 

HETATM 6097 OH2 WAT G 71 

HETATM 6098 OH2 WAT G 72 

HETATM 6099 OH2 vJAT G 73 

HETATM 6100 OH2 WAT G 7 4 

HETATM 6101 OH2 WAT G 75 

HETATM 6102 OH2 WAT G 76 

HETATM 6103 OH2 WAT G 77 

HETATM 6104 OH2 WAT G 7 8 

HETATM 6105 OH2 WAT G 79 

HETATM 6106 OH2 WAT G 80 

HETATM 6107 OH2 WAT G 81 

HETATM 6108 OH2 WAT G 32 

HETATM 6109 OH2 WAT G 83 

HETATM 6110 OH2 WAT G 84 

HETATM 6111 OH2 WAT G 85 

HETATM 6112 OH2 WAT G 86 

HETATM 6113 OH2 WAT G 87 

HETATM 6114 OH 2 WAT G 88 

HETATM 6115 OH2 WAT G 8 9. 

-ETATM 6116 OH2 WAT G 90 

HETATM 6117 OH2 WAT G 91 

HET7TM 6118 OH2 WAT G 92 

HETr.rM 6119 OH2 WAT G 93 

HETATM 6120 OH2 WAT G 94 

HETATM 6121 OH2 WAT G 9 5 

HETATM 6122 OK2 WAT G 9 6 

HETATM 6123 OH2 WAT G 97 

HETATM 6124 OH2 WAT G 98 

HETATM 6125 OH2 WAT G 99 

HETATM 6126 OH2 WAT G 100 

U STATM 6127 OH2 WAT G 101 

HETATM 6128 OH2 WAT G 102 

HETATM 6129 OH2 WAT G 103 

— TATM 6130 OH2 WAT G 104 

HETATM 6131 OH 2 WAT G 105 

-ETATM 6132 OH 2 WAT G 106 

HETATM 6133 OH2 WAT G 107 
'-"ETATM 5134 OH2 WAT G 108 
"-"STATM 6135 OH2 WAT G 109 
•-'STATM 613 6 OH2 WAT G 110 
. v^TATM 6137 OH2 WAT G 111 
♦J™jiTM 6138 OH2 WAT G 112 



38.458 -0.443 63.035 1.00 28.35 
64.786 7.930 107.466 1.00 34.46 
50.823 36.521 114.809 1.00 40.51 
33.963 -10.352 68.080 1.00 39.11 
71 328 -14.321 86.007 1.00 33.30 
63 272 10.210 79.836 1.00 35.75 
59.263 -12.096 94.306 1.00 29.57 
46.041 10.641 76.561 1.00 27.97 
46.614 -13.620 89.775 1.00 24.25 
76.600 0.622 89.097 1.00 29.19 
53 555 6.439 79.089 1.00 34.05 
71.301 11.026 83.310 1.00 35.02 
28 188 -9.956 81*.594 1.00 33.21 
53.084 20.992 98.483 1.00 27.64 
59 484 8.630 93.423 1.00 30.30 
26.195 -3.809 95.805 1.00 33.04 
26.095 -0.121 89.620 1.00 37.39 
47.100 -6.141 109.711 1.00 20.88 
23.273 0.731 92.275 1.00 30.38 
45 340 -24.751 72.694 1.00 37.51 
33.754 16.234 111.676 1.00 34.63 
52.831 19.209 126.276 1.00 47.11 
50.218 16.953 111.099 1.00 26.24 
44 791 5.844 70.857 1.00 24.95 
49.517 -18.731 82.921 1.00 29.48 
76.379 10.131 116.550 1.00 48.70 
30.214 -8.086 87.873 1.00 46.35 
45.320 12.061 80.458 1.00 30.80 
72.881 5.360 86.249 1.00 29.04 
59 674 -23.046 87.252 1.00 41.96 
40.619 7.921 100.345 1.00 26.45 
41 666 -19.477 70.073 1.00 36.27 
46.408 -6.539 92.717 1.00 25.78 
35.743 -12:230 ,81.646 1.00 28.34 
28 268 8.745 121.961 1.00 41.15 
68.843 3.154 71.986 1.00 32.34 
52.125 -11.158 85.150 1.00 24.14 
75 374 -1.773 92.264 1.00 26.12 
46.957 12.230 142.271 1.00 37.07 
63 789 9.551 64.329 1.00 55.58 
60.672 21.185 72.215 1.00 58.55 

56 547 9.505 82.064 1.00 31.10 
26.366 -0.876 92.250 1.00 29.70 
67 604 -16.583 80.808 1.00 32.85 
23 910 1.899 82.068 1.00 42.95 

50 032 4.106 117.380 1.00 30.05 
26 774 -9.492 83.952 1.00 43.59 
42.714 -0.637 113.787 1.00 40.17 
57.966 7.989 134.170 1.00 47.82 
54 478 -3.550 119.086 1.00 36.62 

53 065 11.696 101.718 1.00 41.62 
58.286 -23.645 68.207 1.00 45.98 

54 855 -9.614 121.975 1.00 34.57 

57 408 -3.352 57.145 1.00 '42.14 
63 -90 20.353 123.667 1.00 33.87 
48*129 -23.143 72.392 1.00 30.23 
62 334 6.913 76.094 1.00 52.01 
34*566 6.529 73.089 1.00 36.29 

51 588 20.869 67.459 1.00 36.85 
^8*160 18.020 129.379 1.00 42.87 
49 082 -11.452 57.603 1.00 43.62 
44*7"!7 -8.605 93.281 1.00 41.95 
67 088 -11.900 94.019 1.00 35.71 
49 561 22.763 100.800 1.00 35.40 
75.853 10.960 124.536 1.00 56.20 
54 383 8.930 136.095 1.00 36.40 
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HETATM 6139 
HETATM 6140 
HETATM 6141 
HETATM 6142 
HETATM 6143 
HETATM 6144 
HETATM 6145 
HETATM 6146 
HETATM 6147 
HETATM 6148 
HETATM 6149 
HETATM 6150 
HETATM 6151 
HETATM 6152 
HETATM 6153 
HETATM 6154 
HETATM 6155 
HETATM 6156 
HETATM 6157 
HETATM 6158 
HETATM 615? 
HETATM 6160 
HETATM 6161 
HETATM 6162 
HETATM 6163 
HETATM 6164 
HETATM 6165 
HETATM 6166 
HETATM 6167 
HETATM 6168 
HETATM 6169 
HETATM 6170 
HETATM 6171 
HETATM 6172 
HETATM 6173 
HETATM 6174 
HETATM 6175 
HETATM 6176 
HETATM 6177 
HETATM 617 8 
HETATM 617 9 
HETATM 6180 
HETATM 6181 
HETATM 6182 
HETATM 6183 
HETATM 6184 
HETATM 6185 
HETATM 6186 
HETATM 6187 
HETATM 6188 
HETATM 6189 
HETATM 6190 
HETATM 6191 
HETATM 6192 
HETATM 6193 
HETATM 6194 
HETATM 6195 
HETATM 6196 
HETATM 6197 
HETATM 6198 
HETATM 6199 
HETATM 6200 
HETATM 6201 
HETATM 6202 
HETATM 6203 
HETATM 6204 



OH2 WAT G 113 
OH2 W&T G 114 
OH2 WAT G 115 
OH2 WAT G 116 
OH2 WAT G 117 
CH2 WAT G 118 
OH2 WAT G 119 
OH2 WAT G 120 
OH2 WAT G 121 
OH2 WAT G 122 
CH2 WAT G 123 
OH2 WAT G 124 
OH2 WAT G 125 
OH2 WAT G 12 6 
OH2 WAT G 127 
OH2 WAT G 12 8 
OH2 WAT G 129 
OH2 WAT G 130 
OH2 WAT G 131 
OH2 WAT G 132 
OH2 WAT G 133 
OH2 WAT G 134 
OH2 WAT G 13 5 
OH2 WAT G 13 6 
OH2 WAT G 137 
OH2 WAT G 13 8 
OH2 WAT G 13 9 
OH2 WAT G 140 
OH2 WAT G 141 
OH2 WAT G 143 
OH2 WAT G 144 
OH2 WAT G 14 5 
OH2 WAT G 146 
OH2 WAT G 147 
OH2 WAT G 148 
OH2 WAT G 149 
OH2 WAT G 150 
OH2 WAT G 151 
OH2 WAT G 152 
OH2 WAT G 153 
OH2 WAT G 154 
OH2 WAT G 155 
OH2 WAT G 156 
OH2 WAT G 157 
OH2 WAT G 158 
OH2 WAT G 159 
OH2 WAT G 160 
OH2 WAT G 161 
OH2 WAT G 162 
OH2 WAT G 163 
OH2 WAT G 164 
OH2 WAT G 165 
OH2 WAT G 166 
OH2 WAT G 167 
OH2 WAT G 168 
OH2 WAT G 169 
OH2 WAT G 170 
OH2 WAT G 171 
OH2 WAT G 172 
OH2 WAT G 173 
OH2 WAT G 174 
OH2 WAT G 175 
OH2 WAT G 176 
OH2 WAT G 177 
OH2 WAT G 178 
OH2 WAT G 179 
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33.114 
42.618 
53.605 
73 .410 
68.899 
31.474 
39.749 
44.921 
31.081 



1.764 67.443 
-4.357 102.345 



-10.816 
-1.010 



66.281 
90.400 



3.789 110.221 
19.159 112.425 



.616 132.457 
.089 137.137 
.617 75.105 



35.554 12.017 105.965 
41.381 -23.534 70.872 
31.999 1.992 73.813 
55.761 10.285 101.654 
30.596 12.964 133.642 
59.611 5.347 136.114 
24.190 12.220 124.679 
70.078 4.455 86.283 
-4.314 125.597 



1.00 37.01 
1.00 39.18 
1.00 31.62 
1.00 34.72 
1.00 35.69 
1.00 28.08 
I. 00 37.43 
1.00 40.80 
1.00 40.86 
1.00 33.58 
1.00 38.10 
1.00 33.97 
1.00 47.66 
1.00 37.98 



57.882 
45.838 -20.690 
47.574 3.186 
46.856 -18.901 



65.884 
79.027 
62.295 
95.358 



40.164 5.047 
27.268 -0.405 122.461 
54.200 -20.155 66.212 
45.435 -10.534 103.626 
25.030 106.499 
-7.518 



31.633 
79 .029 
68.597 
64 .263 
49.387 
23.383 
42.360 
34.421 
31.506 
34.963 
54.859 
34.695 
40.348 
66.912 
31.096 
28 .074 
63 .586 
54.145 
62 .443 
58.832 
37 .701 
43 .599 
23.540 
59.915 
51.265 
58.109 
46.553 



93.606 
20.711 111.685 
8.524 113.832 



-24.485 
-3 .854 
-0.710 
-3.304 
3.-409 
10.688 

-15.085 



70.152 
83 .604 
61. 686 
65. 685 
89.579 
91.806 
96.769 



55.706 -21.025 
67 .146 
47.445 
65.193 
36.176 
70.527 
67.166 
19.700 
55.375 
61.874 
36.771 
63.224 
29.606 
52.811 



2.391' 131.273 
1.395 61.905 
17.666 127.489 
19.900 103.232 
-4.222 70.175 
-1.894 99.003 
-22.222 88.415 
13.765 89.547 
9.798 101.311 
-5.528 119.322 
13.442 131.274 
-1.137 96.111 
-4.318 110.873 
-8.264 60.546 
7.024 98.294 
18.195 74.179 
92.515 
-1.958 109.704 
-3.047 134.746 
5.304 63.562 
8.979 102.024 
70.886 



5.797 
8.735 
9.630 
11.277 
8.432 



46.39 
30.77 
00 36.11 
00 41.40 
00 35.98 
00 36.67 
00 45.40 
00 31.38 
00 38.16 



.00 
.00 



1.00 37.55 
1.00 37.96 
1.00 43.94 
1.00 40.55 
1.00 33.25 
1.00 40.63 
1.00 34.07 
1.00 32.22 
1.00 35.94 
1.00 35.42 
1 .00 39.86 
1.00 31.12 
1.00 46.65 
1.00 39.22 
1.00 34.09 
1.00 45.19 
1.00 43.45 
1.00 28.86 
1.00 41.15 
1.00 40.92 
1.00 33.69 



00 31.00 
00 45.00 
00 38.43 
00 51.83 
.00 41.92 
.00 31.25- 
.00 46.30 
.00 37.53 



74.628 
81.850 
87 .176 
91.682 



-6.815 121.530 
7.776 89.317 
15.345 132.470 
11.799 98.957 



1 
1 
1 
1 
1 
1 
1 
1 

1.00 43.91 
1.00 43.13 
1.00 27.99 
1.00 36.05 
1.00 39.63 
1.00 44.69 
1.00 51.41 
1.00 53.49 
1.00 38.63 
1.00 40.08 
1.00 32.57 
1.00 29.83 
1.00 47.28 
1.00 36.09 
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HETATM 6205 OH2 WAT G 180 
HETATM 6206 OH2 W&T G 181 
HETATM 6207 OH2 WAT G 182 
HETATM 6208 OH2 WAT G 183 
HETATM 6209 OH2 WAT G 184 
HETATM 6210 OH2 WAT G 185 
HETATM 6211 OH2 WAT G 186 
HETATM 6212 OH2 WAT G 187 
HETATM 6213 OH2 WAT G.18S 
HETATM 6214 OH2 WAT G 189 
HETATM 6215 OH2 WAT G 190 
HETATM 6216 OH2 WAT G 191 
HETATM 6217 OH2 WAT G 192 
HETATM 6218 OH2 WAT G 193 
HETATM 6219 OH2 WAT G 194 
HETATM 6220 OH2 WAT G 195 
HETATM 6221 OH2 WAT G 196 
HETATM 6222 OH2 WAT G 197 
HETATM 6223 OH2 WAT G 198 
HETATM 6224 OH2 WAT G 199 
HETATM 6225 OH2 WAT G 200 
HETATM 6226 OH2 WAT G 201 
HETATM 6227 OH2 WAT G 202 
HETATM 6228 OH2 WAT G 203 
HETATM 6229 OH2 WAT G 204 
HETATM 6230 CH2 WAT G 205 
HETATM 6231 OH2 WAT G 206 
HETATM 6232 OH2 WAT G 207 
HETATM 6233 OH2 WAT G 208 
HETATM 623 4 OH2 WAT G 2 09 
HETATM 6235 OH2 WAT G 210 
HETATM 623 6 OH2 WAT G 211 
HETATM 6237 OH2 WAT G 212 
HETATM 623 8 OH2 WAT G 213 
HETATM 6239 OH2 WAT G 214 
HETATM 6240 OH2 WAT G 215 
HETATM 6241 OH2 WAT G 216 
HETATM 6242 OH2 WAT G 217 
HETATM 6243 OH2 WAT G 218 
HETATM 6244 OH2 WAT G 219 
HETATM 6245 OH2 WAT G 220 
HETATM 6246 OH2 WAT G 221 
HETATM 6247 OH2 WAT G 222 
HETATM 6248 OH2 WAT G 223 
HETATM 6249 OH2 WAT G 224 
HETATM 6250 OH: WAT G 225 
HETATM 6251 OH.. WAT G 226 
HETATM 6252 OH1 WAT G 227 
HETATM 6253 OH2 WAT G 228 
HETATM 6254 OH2 WAT G 229 
HETATM 6255 OH2 WAT G 230 
HETATM 6256 CH2 WAT G 231 
HETATM 6257 OH2 WAT G 232 
HETATM 6258 OH2 WAT G 233 
HETATM 6259 OH2 WAT G 234 
HETATM 6260 OH2 WAT G 235 
HETATM 6261 OH2 WAT G 236 
HETATM 6262 OH2 WAT G 237 
HETATM 6263 OH2 WAT G 238 
HETATM 6264 OH2 WAT G 23 9 
HETATM 5265 OH 2 WAT G 240 
HETATM 5266 CH2 WAT G 241 
«ETATM 62 67 GH2 WAT G 242 
HETATM 5268 0H2 WAT G 243 
HETATM 6269 0H2 WAT G 244 
U ETATM 6270 OH2 WAT G 245 



38.589 18.249 88.356 1.00 36.19 
43.734 -15.681 61.135 1.00 34.24 
42.283 15.251 91.437 1.00 37.96 
57.121 -11.129 126.206 i.00 45.78 
50.011 -19.367 92.127 1.00 36.55 
56.S80 2.453 95.959 1.00 39.12 
26.356 14.125 125.052 1.00 32.68 
24.631 20.230 122.650 1.00 45.67 

23 516 4.964 81.599 1.00 42.16 
55.017 14.964 62.948 1.00 50.18 
33.371 13.710 105.640 1.00 37.04 
44.466 -10.386 91.144 1.00 36.62 
28.437 22.668 121.285 1.00 38.19 
29.786 24.957 122.112 1.00 42.05 
28.852 3.461 96.101 1.00 48.35 
41.681 11.318 92.011 1.00 32.60 
26.812 -10.229 111.631 1.00 47.70 
42.432 -23.250 76.629 1.00 48.86 
25.484 12.756 121.410 1.00 43.09 
43.514 -20.514 111.706 1.00 46.80 
74.273 -13.079 95.699 1.00 44.89 
59.982 24.381 103.984 1.00 40.63 
67.164 -12.771 74.705 1.00 35.13 
28.708 9.211 79.238 1.00 33.53 
53 256 -3.576 122.243 1.00 48.49 
50.706 16.208 87.357 1.00 41.26 
50.000 34.998 98.339 1.00 39.15 
68.078 -16.236 83.621 1.00 29.70 

24 395 ^4.134 111.635 1.00 50.82 
53.384 -2.664 114.289 1.00 44.49 
60.120 -9.482 94.788 1.00 31.97 
23 405 17.472 111.744 1.00 44.97 
46.214 20.943 76.878 1.00 59.14 
29 754 6/983 97.109 1.00 41.78 
46.820 -0.465 55.181 1.00 41.02 
59.143 22.096 124.775 1.00 38.42 
42.674 14.088 66.037 1.00 32.50 
55.009 -7.248 98.186 1.00 56.50 
63 361 -8.209 109.653 1.00 49.66 
66.583 -8.146 94.571 1.00 50.91 
44.627 -2.583 93.919 1.00 36.99 
24.470 -8.606 79.502 1.00 47.24 
76.913 -7.777 83.973 1.00 50.43 
32.788 0.651 129.136 1.00 42.47 
73 731 -16.880 88.817 1.00 46.69 
78.567 -2.802 93.970 1.00 43.17 
45.681 1.248 57.532 1.00 35.84 
38.263 15.236 84.711 1.00 42.39 
38.933 35.224 108.488 1.00 52.23 

33 755 14.939 70.228 1.00 46.56 
51.521 34.184 100.859 1.00 52.96 

34 140 0.565 63.039 1.00 31.02 
37 277 13.977 81.662 1.00 39.83 
57 307 5.947 93.216 1.00 28.55 
31 718 16.820 125.707 1.00 53.16 
60.524 31.119 110.067 1.00 51.02 
44 *>57 4.267 94.916 1.00 57.50 
68.454 1.072 70.111 1.00 38.65 
27.836 6.773 79.253 1.00 44.95 
54.923 23.344 81.087 1.00 42.95 
34.072 -15.271 71.579 1.00 45.78 
35.956 -1.059 61.973 1.00 38.91 
29.587 1.898 127.376 1.00 44.85 
49 -34 -10.150 113.501 1.00 38.32 
57 252 9 773 96.696 1.00 48.83 
621310 13.262 80.972 1.00 38.54 
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75.703 
32.019 
54.081 
32.801 
45.040 
39.815 
28.763 
49.668 -12 
64.353 
75.183 
46.239 
68.708 
71.504 
36.309 
65.973 
71.952 



-5.552 102.815 
21.564 79.321 



20.866 108.414 
19.763 61.067 



9.828 
-6.179 
12.667 
5.089 



67.749 
66.319 
85.010 
97.225 



3.406 122.119 
2.630 128.600 
12.973 113.965 



3 
-6 
0 



421 
170 

,301 



56.994 
91.078 
95.449 



PCT/US00/24700 



1.00 43.23 
1.00 36.79 
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HETATM 6337 OH2 WAT G 312 

HETATM 6338 OH2 WAT G 313 

HETATM 6339 OH2 WAT G 314 

HETATM 6340 OH2 WAT G 315 

HETATM 6341 OH2 WAT G 316 

HETATM 6342 OH2 WAT G 317 

HETATM 6343 OH2 WAT G 318 

HETATM 6344 OH2 WAT G 319 

HETATM 6345 OH2 WAT G 320 

HETATM 6346 OH 2. WAT G 321 

HETATM 6347 OH2 WAT G 322 

HETATM 6348 OH2 WAT G 323 

HETATM 6349 OH2 WAT G 3 24 

HETATM 63 50 OH2 WAT G 325 

HETATM 6351 OH2 WAT G 32 6 

HETATM 6352 OH2 WAT G 327 

HETATM 6353 OH2 WAT G 3 28 

HETATM 6354 OH2 WAT G 329 

HETATM 6355 OH2 WAT G 330 

HETATM 6356 OH2 WAT G 331 

HETATM 6357 OH2 WAT G 33 2 

HETATM 63 58 OH2 WAT G 333 

HETATM 6359 OH2 WAT G 3 34 

HETATM 63 60 OH2 WAT G 33 5 

HETATM 6361 OH2 WAT G 33 6 

HETATM 6362 OH2 WAT G 337 

HETATM 6363 OH2 MAT G 33 8 

HETATM 6364 OH2 WAT G 33 9 

HETATM 63 65 OH2 WAT G 340 

HETATM 6366 OH2 WAT G 341 

HETATM 6367 OH2 WAT G *342 

HETATM 6368 OH2 WAT G 343 

HETATM 6369 OH2 WAT G 344 

HETATM 6370 OH2 WAT G 345 

HETATM 6371 OH2 WAT G 346 

HETATM 6372 OH2 WAT G 347 

HETATM 6373 OH 2 WAT G 3 48 

HETATM 6374 OH2 WAT G 349 

HETATM 6375 OH2 WAT G 350 

HETATM 6376 OH2 WAT G 3 51 

HETATM 6377 OH2 WAT G 3 52 

HETATM 6378 OH2 WAT G 353 

HETATM 6379 OH2 WAT G 3 54 

HETATM 6380 OH2 WAT G 3 55 

HETATM 63 81 OH2 WAT G 356 

HETATM 6382 OH2 WAT G 3 57 

HETATM 6383 OH2 WAT G 3 58 

HETATM 6384 OH2 WAT G 359 

HETATM 63 85 OH2 WAT G 360 

HETATM 6386 OH2 WAT G 361 

HETATM 6387 OH2 WAT G 362 

HETATM 6388 OH2 WAT G 363 

HETATM 6389 OH2 WAT G 3 64 

HETATM 6390 OH2 WAT G 3 65 

HETATM 63 91 OH2 WAT G 3 66 

HETATM 6392 OH2 WAT G 3 67 

HETATM 6393 OH2 WAT G 3 68 

HETATM 6394 OH2 WAT G 369 

HETATM 6395 OH2 WAT G 370 

^ETATM 6396 OH2 WAT G 371 

' HETATM 5397 OH2 WAT G 372 
HETATM 5398 OH2 WAT G 373 
HETATM 6399 OH2 WAT G 374 
VSTATM 6400 OH2 WAT G 375 
-'eTATM 5401 OH2 WAT G 376 
v-TATM 6402 OH2 WAT G 377 
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HETATM 6403 OH2 WAT G 378 
HETATM 6404 CK2 WAT G 379 
HETATM 6405 OH2 WAT G 380 
HETATM 6406 OH2 WAT G 3 81 
HETATM 6407 0H2 WAT G 3 82 
HETATM 6408 0H2 WAT G 383 
HETATM 6409 0H2 WAT G 384 
HETATM 6410 OH 2 WAT G 385 
HETATM 6411 OH2 WAT G 386 
HETATM 6412 OH2 WAT G 3 87 
HETATM 6413 OH2 WAT G 388' 
HETATM 6414 OH2 WAT G 3 89 
HETATM 6415 OH2 WAT G 390 
HETATM 6416 OH2 WAT G 391 
HETATM 6417 OH2 WAT G 3 92 
HETATM 6418 DH2 WAT G 3 93 
HETATM 6419 OH2 WAT G 3 94 
HETATM 6420 GH2 WAT G 395 
HETATM 6421 OH2 WAT G 396 
HETATM 6422 OH2 WAT G 3 97 
HETATM 6423 OH2 WAT G 398 
HETATM 6424 OH2 WAT G 399 
HETATM 6425 OH2 WAT G 400 
HETATM 6426 OH2 WAT G 401 
HETATM 6427 OH2 WAT G 402 
HETATM 6428 OH2 WAT G 403 
HETATM 6429 OH2 WAT G 404 
HETATM 6430 OH2 WAT G 405 
HETATM 6431 OH2 WAT G 406 
HETATM 6432 OH2 WAT G 407 
HETATM 643 3 OH2 WAT G 408 
HETATM 6434 OH2 WAT G 409 
HETATM 6435 OH2 WAT G 410 
HETATM 6436 OH 2 WAT G 411 
HETATM 6437 OH2 WAT G 412 
HETATM 643 8 OH2 WAT G 413 
HETATM 6439 OH2 WAT G 414 
HETATM 6440 OH2 WAT G 415 
HETATM 6441 OH2 WAT G 416 
HETATM 6442 OH2 WAT G 417 
HETATM 6443 OH2 WAT G 418 
HETATM 6444 OH2 WAT G 419 
HETATM 6445 OH2 WAT G 420 
HETATM 6446 OH2 WAT G 421 
HETATM 6447 OK 2 WAT G 422 
HETATM 6448 0H2 WAT G 423 
HETATM 6449 OH2 WAT G 424 
HETATM 6450 OH2 WAT G 425 
HETATM 6451 OH2 WAT G 426 
HETATM 6452 OH2 WAT G 427 
HETATM 64 53 OH2 WAT G 428 
HETATM 6454 CH2 WAT G 429 
HETATM 64 55 0H2 WAT G 43 0 
HETATM 64 56 OH2 WAT G 431 
HETATM 6457 QH2 WAT G 432 
HETATM 6458 OH2 WAT G 433 
HETATM 64 59 OH2 WAT G 434 
HETATM 6460 OH2 WAT G 435 
HETATM 6461 0H2 WAT G 436 
HETATM 64 62 0H2 WAT G 437 
HETATM 6463 OH2 WAT G 43 8 
HETATM 6464 OH2 WAT G 439 
HETATM 6465 OH2 WAT G 440 
HETATM 6466 0H2 WAT G 441 
• HETATM 6467 OH2 WAT G 442 
HETATM 6468 OH2 WAT G 443 



55 780 -14.986 65.487 1.00 49.09 
40.990 21.205 91.611 1.00 41.02 
48.157 1.057 116.992 1.00 44.84 

37 954 -6.221 128.334 1.00 37.09 
30.221 27.743 109.194 I. 00 39.92 
49.926 -12.826 118.421 1.00 58.95 
42.435 -17.636 81.477 1.00 48.47 
58.226 -25.990 71.378 1.00 48.18 
40 495 17.944 128.741 1.00 43.82 
31.943 6.301 109.475 1.00 35.53 
47 277 2.559 100.509 1.00 43.00 

38 862 9.112 102.620 1.00 31.70 
71 652 14.568 105.167 1.00 49.63 
68.554 -10.518 73.331 1.00 38.16 

70 496 -16.160 84.425 1.00 32.16 
44. 698 -24.950 75.603 1-00 43.38 

56.172 15.369 55.027 1.00 47.44 
46 150 -9.441 99.999 1.00 47.98 
26.892 -8.356 89.057 1.00 34.99 
31 737 14.380 90.395 1.00 50.78 
36 261 -13.824 62.777 1.00 50.86 
37.312 15.242 134.977 1.00 43.57 
33.728 13.773 126. 41* 1.00 57.13 
45.269 27.937 130.311 1.00 49.55 
44 887 -17.414 111.508 1.00 54.29 
68.928 0.455 136.711 1.00 49.90 
43.271 -21.571 64.425 1.00 48.61 
24.243 -4.781 108.590 1.00 51. 05, 

54.828 5.311 59.009 1.00 43.43 
53.460 27.992 124.076 1.00 47.83 
70.833 -18.390 85.386 1.00 49.26 

71 497 15.287 113.071 1.00 34.52 
36.407 -18.480 110.466 1.00 55.43 
26.220 -9.551 78.158 1.00 47.69 
52 319 26.326 82.038 1.00 42.00 

76.173 14.097 122.253 1.00 44.90 
58.379 6.335 123.024 1.00 54.61 

72 162 -16.705 82.719 1.00 50.63 

63 557 26.152 65.944 1.00 39.83 

38 935 23.070 122.742 1.00 52.57 
55.256 -10.714 124.501 1.00 42.38 
55 443 -9.037 110,170 1.00 46.47 
73.873 16.578 123.288 1.00 46.54 
74 426 12.663 117.527 1.00 43.62 
52.374 -0.368 51.502 1.00 56.99 
60.339 20.215 84.713 1.00 36.27 
48 308 1.354 54.561 1.00 38.53 
61 757 21.606 115.976 1.00 61.09 
33 222 -14.916 119.528 1.00 51.12 
47.477 3.359 112.298 1.00 46.10 

39 909 2.272 138.388 1.00 35.33 

57.829 15.336 126.262 1.00 62.59 
48 917 -5.857 119.191 1.00 51.45 
44 139 -3.812 132.964 1.00 44.91 
38.885 18.594 95.398 1.00 50.23 
^2.628 -7.064 55.271 1.00 38.96 
60.644 -0.731 101.129 1.00 47.30 

64 772 5.808 71.942 1.00 50.81 
-9-571 16.705 80.180 1.00 34.07 
32.791 -0.551 65.371 1.00 4...40 
=3 -18 -7.989 60.087 1.00 46.94 
^6 982 5.474 120.408 1.00 46.28 
72 138 1.233 90.050 1.00 50.13 
29 494 10.971 118.393 1.00 56.30 
69.232 5.594 113.941 1.00 58.17 
61.459 11.576 71.140 1.00 61.67 
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Figure 19-3 
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Figure 19-5 



ATOM 


265 


O 


ARG A 


33 


ATOM 


266 


N 


PHE A 


34 


ATOM 


267 


CA 


PHE A 


34 


ATOM 


268 


CB 


PHE A 


34 


ATOM 


269 


CG 


PHE A 


34 


ATOM 


270 


CD1 


PHE A 


34 


ATOM 


211 


CD2 


PHE A 


34 


ATOM 


272 


CE1 ' 


PHE A 


34 


ATOM 


273 


CE2 


PHE A 


34 


ATOM 


274 


cz 


PHE A 


34 


ATOM 


275 


c 


PHE A 


34 


ATOM 


276 


o 


PHE A 


34 


ATOM 


277 


N 


LYS A 


35 


ATOM 


278 


CA 


LYS A 


35 


ATOM 


279 


CB 


LYS A 


35 


ATOM 


280 


CG 


LYS A 


35 


ATOM 


281 


CD 


LYS A 


35 


ATOM 


282 


CE 


LYS A 


35 


ATOM 


283 


NZ 


LYS A 


35 


ATOM 


284 


C 


LYS A 


35 


ATOM 


285 


o 


LYS A 


35 


ATOM 


286 


N 


ASP A 


36 


ATOM 


287 


CA 


ASP A 


36 


ATOM 


288 


CB 


ASP A 


36 


ATOM 


289 


CG 


ASP A 


36 


ATOM 


290 


OD1 


ASP A 


36 


ATOM 


291 


OD2 


ASP A 


36 


ATOM 


292 


C 


ASP A 


36 


ATOM 


293 


0 


ASP A 


36 


ATOM- 


294 


N 


ALA A 


37 


ATOM 


295 


CA 


ALA A 


37 


ATOM 


296 


CB 


ALA A 


37 


ATOM 


297 


C 


ALA A 


37 


ATOM 


298 


O 


ALA A 


37 


ATOM 


299 


N 


MET A 


38 


ATOM 


300 


CA 


MET A 


38 


ATOM 


301 


CB 


MET A 


38 


ATOM 


302 


CG 


MET A 


38 


ATOM 


303 


SD 


MET A 


38 


ATOM 


304 


CE 


MET A 


38 


ATOM 


305 


C 


MET A 


38 


ATOM 


306 


0 


MET A 


38 


ATOM 


307 


N 


ASN A 


39 


ATOM 


308 


CA 


ASN A 


39 


ATOM 


309 


CB 


ASN A 


39 


ATOM 


310 


CG 


ASN A 


39 


ATOM 


311 


OD1 


ASN A 


39 


ATOM 


312 


ND2 


ASN A 


39 


ATOM 


313 


C 


ASN A 


39 


ATOM 


314 


O 


ASN A 


39 


ATOM 


315 


N 


LEU A 


40 


ATOM 


316 


CA 


LEU A 


40 


ATOM 


317 


CB 


LEU A 


40 


ATOM 


.318 


CG 


LEU A 


40 


ATOM 


319 


CD1 


LEU A 


40 


ATOM 


320 


CD2 


LEU A 


40 


ATOM 


321 


C 


LEU A 


a r\ 

40 


ATOM 


322 


0 


LEU A 


40 


ATOM 


323 


N 


ILE A 


41 


ATOM 


324 


CA 


ILE A 


41 


ATOM 


325 


CB 


ILE A 


41 


ATOM 


326 


CG2 


, ILE A 


41 


ATOM 


327 


CGI ILE A 


41 


ATOM 


328 


CD1 ILE A 


41 


ATOM 


329 


C 


ILE A 


41 


ATOM 


330 


O 


ILE A 


41 



38.713 

37.247 

36.292 

35.065 

33.925 

33.108 

33.668 

32.044 

32.607 

31.794 

36.881 

36.903 

37.350 

37.928 

38.230 

37.000 

37.414 

38.062 

39.058 

39.185 

39.554 

39.853 

41.034 

41.812 

42.964 

43.648 

43.201 

40.568 

41.231 

39 .420 

38.851 

37 .751 

38.291 

38.268 

37.830 

37.287 

36.289 

35.084 

33.980 

33.550 

38.430 

38.226 

39.628 

40.805 

41.200 

41.393 

42. 180 

40.677 

40.483 

40.565 

40.095 

39.750 

38.259 

37.231 

35.837 

37.372 

40.555 

40.196 

41.652 

42.508 

42.017 

42.070 

42.898 

42.854 

43.921 

44.106 



53 .390 
51.682 
52.233 
51.310 
51.840 
52.856 
51.315 
53.343 
51.797 
52.809 
52.414 
53.524 
51.324 
51.401 
50.010 
49.190 
47.810 
47.072 
47.928 
52.255 
52.775 
52.384 
53.216 
52.943 
53.908 
54.218 
54.341 
54.670 
55.527 
54.940 
56.280 
56.373 
56.617 
57.779 
55.600 
55.794 
54.687 
54.559 
55.948 
55.878 
55.724 
55.930 
55.428 
55.266 
56.600 
57.736 
57.624 
58. 838 
54.212 
54.490 
53.010 
51.898 
51.559 
52.581 
52.089 
52.798 
50.628 
49.530 
50.794 
49.680 
43.991 
. 49.952 
47.790 
46.701 
50.226 
51.346 



56.592 
56.562 
57.517 
57.573 
58.405 
57.925 
59.672 
58.695 
60.454 
59.966 
58.918 
59.455 
59.516 
60.843 
61.362 
61.662 
62.106 
60.948 
60.236 
60.881 
61 .929 
59.745 
59.680 
58.388 
58.186 
59.184 
57.035 
59.724 
60.306 
59.111 
59.108 
58.067 
60.499 
60.899 
61.226 
62.572 
62.917 
61.996 
62.101 
63.849 
63.583 
64.777 
63.090 
63.935 
64.589 
63.571 
62.630 
63.772 
65.009 
66.205 
64.570 ' 
65.474 
65.386 
65.879 
65.554 
67.375 
65.187 
65.618 
64.464 
64.116 
62.811 
61.636 
62.480 
63.500 
63.916 
63.413 



.00 33.61 
.00 33.78 
.00 33.79 
.00 33.88 
.00 33.16 
32.77 



1. 
1. 
1. 
1. 

72 1.00 33.05 



.00 



.00 
.00 



32 .86 
33.07 



1.00 32.58 
1.00 34.01 
1.00 33.49 
1.00 34.00 
1.00 33.90 
1.00 34.07 
1.00 33.94 
1.00 35.31 
1.00 35.91 
1.00 36.19 
1.00 34.19 



00 34.32 
00 33.99 
00 35.17 
00 37.40 
00 38.64 
00 40.02 
00 38.74 
1.00 35.85 
1.00 36.88 
1.00 34.96 
1.00 34.47 
1.00 33.80 
1.00 34.66 
1.00 34.55 
00 34.24 
00 33.07 
00 32.82 
32 



00 

00 33. 



72 
65 



1.00 33.77 
1.00 33.12 
1.00 32.82 
,00 32.64 
.00 32.38 
.00 32.93 
.00 34.40 
.00 34.98 



1 
1 
1 
1 
1 

1.00 33.52 
1.00 31.69 
1 
1 
1 
1 
1 
i 



00 31.12 
00 31.76 
.00 32.48 
.00 32.55 
.00 32.84 
.00 33.79 
1.00 32.45 
1.00 32.92 
1.00 31.64 
1.00 34.12 
1.00 36.07 
1.00 35.51 
1.00 33.37 
1.00 35.97 
1.00 37.19 
1.00 38.85 
I. 00 38.98 



AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 

AAAA 

* 

nrtnn 

AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 



WO 01/18045 



221/263 

Figure 19-6 



PCT/US00/24700 



ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM- 
ATOM 
ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 
. ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM ' 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



331 
332 
333 
334 
335 

336 

337 

338 

339 

340 

341 

342 

343 

344 

345 

346 

347 

348 

349 

350 

351 

352 ■ 

353 

354 

355 

356 

357 

358 

359 

360 

361 

362 

363 

364 

365 

366 

367 

368 

369 

370 

371 

372 

373 

374 

375 

376 

377 

378 

379 

380 

381 

382 

383 

384 

385 

386 

387 

388 

389 

390 

391 

392 

393 

394 

395 

396 



N 


ASP 




CA 


ASr 


A 


CB 


ASF 


n 


CG 


ASP 


A 


OD1 


ASP 


A 


OD2 


ASP 


A 


C 


ASP 


A 


O 


ASP 


A 


N 


GLU 


A . 


GA 


_GLU 


A 


CB 


GLU 


A 


CG 


GLU 


A. 


CD 


GLU 


A 


OE1 


GLU 


A 


OE2 


GLU 


A 


C 


GLU 


A 


O 


GLU 


A 


N 


LYS 


A 


CA 


LYS 


A 


CB 


LYS 


A 


CG 


LYS 


A 


CD 


LYS 


A 


CE 


LYS 


A 


NZ 


LYS 




C 


LYS 


s 
r\ 


O 


LYS 


A 


N 


GLU 


A 


CA 


GLU 


A 


CB 


GLU 


A 


CG 


GLU 


A 


CD 


GLU 


A 



OE1 GLU 
OE2 GLU 
C GLU 



GLU 
LEU 



CA LEU 
CB LEU 
CG LEU 
CD1 LEU 
CD2 LEU 
C LEU 



O 
N 



LEU 
ILE 



CA ILE 
CB ILE 
CG2 ILE 
CGI ILE 
CD1 ILE 
C ILE 



ILE 
LYS 



CA LYS 

CB LYS 

CG LYS 

CD LYS 

CE LYS 

HZ LYS 

C LYS 



O 
It 



LYS 
SER 



CA SER 
CB SER 



OG 

C 

O 



SER 
SER 
SER 



42 
42 
42 
42 
42 
42 
42 
42 
43 
43 
43 
43 
43 
43 
43 
43 
43 
44 
44 
44 
44 
44 
44 
44 
44 
44 
45 
45 
45 
45 
45 
45 
45 
45 
45 
46 
46 
46 
46 
46 
46 
46 
46 
47 
47 
47 
47 
47 
47 
47 
47 
48 
48 
48 
48 
48 
48 
48 
48 
48 
49 
49 
49 
. 49 
49 
49 



44.914 

46.309 

46.973 

46.316 

46.227 

45.891 

47.011 

46.525 

48.147 

48.905 

50.172 

49.924 

51.187 

51.760 

51.601 

49.290 

49.214 

49.708 

50.135 

50.762 

51.977 

52.641 

53.851 

54.615 

49.029 

49.296 

47.793 

46.638 

45.493 

45.788 

44.663 

44.383 

44 .056 

46.126 

45.301 

46.619 

46.219 

46.125 

45.608 

44.182 

45.646 

47.211 

48.424 

46.680 

47.497 

47.144 

48.093 

47.220 

48.596 

47.138 

45.956 

48.150 

47.920 

49.203 

49.116 

50.464 

50.493 

49 .409 

47.449 

48.024 

46.385 

45.854 

44.514 

43.541 

46 .814 

47.462 



49.446 

49.843 

50.021 

51.110 

52.250 

50.833 

48.752 

47.620 

49.090 

48.141 

48.796 

50.057 

50.580 

49.839 

51.714 

46.859 

45.773 

46.986 

45.832 

46.306 

47.215 

47.734 

48.601 

49.033 

44.828 

43.735 

45.190 

44.320 

45.125 

45.731 

46.618 

47.631 

46.300 

43 .-648 

42.737 

44.115 

43.589 

44.750 

44.550 

44.021 

45.893 

42.542 

42.670 

41.484 

40.411 

39.024 

37.970 

39.063 

39.241 

40.381 

40.373 

40.380 

40.349 

40.727 

40.695 

41.104 

40.893 

41.645 

38.950 

37.938 

38.892 

37.611 

37.795 

38.349 

36.891 

37.513 



64.329 
64.181 
65.553 
66.381 
65.883 
67.526 
63.392 
63.333 
62.789 
61.980 
61.454 
60. 6J58 
60/028- 
59.201 
60.349 
62.701 
62.131 
63.954 
64.730 
66.048 
65.799 
67.071 
66.727 
67.936 
64.996 
65.480 
64.659 
64.894 
65.517 
66.882 
67.360 
66.693 
68.399 
63.630 
63 .681 
62.497 
61.211 
60.229 
,58.817 
58.843 
58.113 
60.714 
60.900 
60. 118 
59.560 
60. 167 
59.640 
61.694 
62.242 
58.076 
57.714- 
57.221 
55.784 
55.055 
53.556 
52.941 
51.432 
50 .764 
55.375 
55.787 
54.581 
54.141 
53.420 
54.276 
53 .207 
52.373 



1.00 
1.00 



00 
00 
00 
00 
00 
00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
I. 00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1 .00 
1 .00 
1.00 



40.61 

42.57 

42 .42 

42.27 

41.20 

43.36 

44.05 

44.88 

45.10 

46.11 

46.89 

49.30 

49.67 

50.60 

49.60 

46.27 

46.00 

46.52 

46.31 

48.16 

51.59 

52.87 

53.34 

53.45 

44.74 

45.35 

42.49 

40.54 

40.55 

38.87 

37.57 

36.29 

38.44 

39.15 

39.29 



00 
00 
00 
00 
00 
00 
00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 



37.62 

35.88 

36.09 

36.50 

36.66 

35.85 

34.97 

35.72 

33.25 

30.92 

31.22 

28.55 

32.04 

34.13 

29.70 

28.42 

28.78 

28.42 

27.53 

28.97 

29.67 

29.41 

29.68 

27 .81 

27.96 

26.82 

26.41 

25.40 

25.58 

26.03 

26.98 



AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
"AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
- AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 
AAAA 



WO 01/18045 



ATOM 


397 


N 


ARG A 


en 


ATOM 


398 


CA 


ABG A 


50 


ATOM 


399 


CB 


ARG A 


50 


ATOM 


400 


CG 


ARG A 


50 


ATOM 


401 


CD 


ARG A 


50 


ATOM 


402 


NE 


ARG A 


50 


ATOM 


403 


CZ 


ARG A 


50 


ATOM 


404 


NH1 


ARG A 


50 


ATOM 


405 


NH2 


ARG A 


50 


ATOM 


406 


C 


ARG A 


50 


ATOM 


407 


O 


ARG A 


50 


ATOM 


408 


N 


PRO A 


51 


ATOM 


409 


CD 


PRO A 


51 


ATOM 


410 


CA 


PRO A 


51 


ATOM 


411 


CB 


PRO A 


51 


ATOM 


412 


CG 


PRO A 


51 


ATOM 


413 


C 


PRO A 


51 


ATOM 


414 


O 


PRO A 


51 


ATOM 


415 


N 


ALA A 


52 


ATOM 


416 


CA 


ALA A 


52 


ATOM 


417 


C3 


ALA A 


52 


ATOM 


418 


C 


ALA A 


52 


ATOM 


419 


O 


ALA A 


52 


ATOM 


420 


N 


THR A 


53 


ATOM 


421 


CA 


THR A 


53 


ATOM 


422 


CB 


THR A 


53 


ATOM 


423 


OG1 


THR A 


53 


ATOM 


424 


CG2 


THR A 


53 


ATOM 


425 


C 


THR A 


53 


ATOM 


426 


O 


THR A 


53 


ATOM 


427 


N 


LYS A 


54 


ATOM 


428 


CA 


LYS A 


54 


ATOM 


429 


CB 


LYS A 


54 


ATOM 


430 


CG 


LYS A 


54 


ATOM 


431 


CD 


LYS A 


54 


ATOM 


432 


CE 


LYS A 


54 


ATOM 


433 


NZ 


LYS A 


54 


ATOM 


434 


C 


LYS A 


54 


ATOM 


435 


O 


LYS A 


54 


ATOM 


436 


N 


GLU A 


55 


ATOM 


437 


CA 


GLU A 


55 


ATOM 


438 


CB 


GLU A 


55 


ATOM 


439 


CG 


GLU A 


55 


ATOM 


440 


CD 


GLU A 


55 


ATOM 


441 


OE1 


GLU A 


55 


ATOM 


442 


OE2 


GLU A 


55 


ATOM 


443 


C 


GLU A 


55 


ATOM 


444 


O 


GLU A 


55 


ATOM 


445 


N 


GLU A 


56 


ATOM 


446 


CA 


GLU A 


56 


ATOM 


447 


CB 


GLU A 


56 


- ATOM 


448 


CG 


GLU A 


56 


ATOM 


449 


CD 


GLU A 


56 


ATOM 


450 


OE1 


GLU A 


56 


ATOM 


451 


OE2 


GLU A 


56 


ATOM 


452 


C 


GLU A 


56 


ATOM 


453 


o 


GLU A 


c c 

3 O 


ATOM 


454 


N 


LEU A 


57 


ATOM 


455 


CA 


LEU A 


57 


ATOM 


456 


CB 


LEU A 


57 


ATOM 


457 


CG 


LEU A 


57 


ATOM 


458 


CD1 


LEU A 


57 


ATOM 


459 


CD2 


LEU A 


57 


ATOM 


460 


\_ 


LEU A 


57 


ATOM 


461 


O 


LEU A 


57 


ATOM 


462 


N 


LEU A 


58 



222/263; 
Figure 19-7 

46.910 35.576 

47.755 34.794 

48.807 33.985 

48.229 32.819 

49.280 31.995 

48.673 30.896 

48.106 29.820 

48.055 29.672 

47.592 28.884 

46.806 33.834 

45.740 33.510 

47.172 33.392 

48.361 33.761 

46.355 32.462 

47.012 32.512 

47.766 33.862 

46.473 31.070 

47.545 30.680 

45.381 30.325 

45.419 28.972 

44.012 28.405 

46.260 28.145 

46.240 28.383 

47.009 27.185 

47.815 26.352 

48.933 25.642 

48.355 24.763 

49.810 26.648 

46.889 25.299 

45.870 24.982 

47.240 24.776 

46.450 23.752 

47.249 23.182 

'46.585 22.020 

45.449 22.464 

45.943 22.850 

46.425 21.664 

46.127 22.640 

45.025 22.097 

47.102 22.312 

46.961 21.260 

48.266 21.089 

48.265 19.901 

49.513 19.839 

49.745 18.770 

50.245 20.859 

45.851 21.555 

45.048 20.681 

45.822 22.782 

44.812 23.164 

45.078 24.588 

46.434 24.721 

46.769 26.135 
46.615 27.057 
47.213 26.315 
43.408 23.043 
42.495 22.574 
43.252 23.447 
41.965 23.389 
42.077 24.063 
42.491 25.545 

42.770 26.108 
41.389 26.341 
41.552 21.946 
40.363 21.612 
42.547 21.085 



53.354 1.00 

52.474 1.00 

53.252 1.00 

54.009 1.00 

54.720 1.00 

55.482 1.00 

54.946 1.00 

53.630 1.00 

55.735 1.00 

51.762 1.00 

52.283 1.00 

50.549 1.00 

49.770 1.00 

49.776 1.00 

48.390 1.00 

48.405 1.00 

50.393 1.00 

50.839 1.00 



50.422 


1. 


00 


50. 


952 


1. 


00 


51. 


029 


1. 


00 


49. 


994 


1. 


00 


48. 


806 


1. 


00 


50. 


501 


1. 


00 


49. 


628 


1. 


00 


50. 


405 


1. 


00 


51. 


375 


1. 


00 


51. 


106 


1. 


00 


49. 


034 


1. 


00 


49. 


620 


1. 


00 


47. 


867 


1. 


00 


47. 


189 


1. 


00 


46. 


015 


1. 


00 


45. 


304 


1 . 


00 


44 . 


417 


1 . 


00 


43. 


025 


1 . 


00 


42. 


236 




00 


48 . 


170 


1 . 


00 


48. 


176 


1 . 


00 


49. 


006 


1. 


.00 


50, 


Oil 


1. 


.00 


50. 


.778 


1. 


.00 


51. 


.706 


1. 


.00 


52. 


.584 


1. 


.00 


53, 


.200 


1, 


.00 


52, 


.672 


1 


.00 


51 


.013 


1 


.00 


51 


.332 


1 


.00 


51 


.517 


1 


.00 


52 


.488 


1 


.00 


52 


.989 


1 


.00 


53 


.670 


1 


.00 


54 


.098 


1 


.00 


53 


.265 


1 


.00 


55 


.255 


1 


.00 


51 


.914 


1 


.00 


52 


.588 


1 


.00 


50 


.659 


1 


.00 


49 


.967 


1 


.00 


48 


.596 


1 


.00 


48 


.656 


1 


.00 


47 


.269 


1 


.00 


49 


.349 


1 


.00 


49 


.796 


1 


.00 


49 


.816 


1 


.00 


49 


.641 


1 


.00 



PCT/USOO/24700 



25.51 AAAA 

25.45 AAAA 

25.85 AAAA 
27.16 AAAA 
27.57 AAAA 

27.90 AAAA 
28.34 AAAA 
28.19 AAAA 
28.62 AAAA 

24.91 AAAA 

23.57 AAAA 
24.28 AAAA 
24.13 AAAA 
24.18 AAAA 
24.24 AAAA 
24.11 AAAA 
23.69 AAAA 
24.13 AAAA 

23.36 AAAA 
23.64 AAAA 

23.86 AAAA 

23.58 AAAA 
24.52 AAAA 
24.41 AAAA 
26.26 AAAA 

26.37 AAAA 
26.51 AAAA 
24.48 AAAA 
27.63 AAAA 

29.22 AAAA 
29.31 AAAA 
30.61 AAAA 

31.68 AAAA 

34.38 AAAA 
36.00 AAAA 
37.55 AAAA 
37.57 AAAA 
31.26 AAAA 
31.72 AAAA 
31.88 AAAA 

32.29 AAAA 
34.43 AAAA 

38.39 AAAA 
41.46 AAAA 

43.30 ' AAAA 
42.45 AAAA 
30.43 AAAA 
30.59 AAAA 

28.23 AAAA 

27.69 AAAA 
27.90 AAAA 
26.64 AAAA 
26.35 AAAA 
25.12 AAAA 

25.70 AAAA 
26.99* AAAA 

26.25 AAAA 

27.26 AAAA 
27.17 AAAA 
26.62 AAAA 
27.64 AAAA 
26.66 AAAA 
26.92 AAAA 
27.26 AAAA 
27.53 AAAA 
27.42 AAAA 



WO 01/18045 



223/263 

Figure 19-8 



PCT/US00/24700 



ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
. ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



463 
464 
465 
466 
4 67 
468 
469 
470 
471 
472 
473 
474 
475 
476 
477 
478 
479 
480 
481 
482 
483 
484 
485 
486 
487 
488 
489 
490 
491 
492 
493 
494 
495 
496 
497 
498 
499 
500 
501 
502 
503 
504 
505 
506 
507 
508 
509 
510 
511 
512 
513 
514 
515 
516 
517 
518 
519 
520 
521 
522 
523 
524 
525 
526 
527 
528 



CA 
CB 
CG 
CD1 
CD2 
C 
0 
N 

CA 
CB 
CG 
CD1 
CD2 
C 
O 
N 
CA 
CB 
CG 
CD1 
CD2 
CEl 
CE2 
CZ 

c 

O 
N 

CA 
CB 
CG 
CD2 
ND1 
CE1 
NE2 
C 
O 
N 

CA 
C3 
OG1 
CG2 



N 

CA 
C3 
CG 
CD 
OE1 
0E2 



N 

CA 
CB 
CG 
OD1 
OD2 
C 
0 
M 



CB 
CG 
CD1 
CE1 



LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
PHE 
PHE 
PHE 
PHE 
PHE 
PHE 
PHE 
PHE 
PHE 
PHE 
PHE 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
THR 
THR 
THR 
THR 
THR 
THR 
THR 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
TYR 
TYR 
TYR 
TYR 
TYR 
TYR 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



58 

58 

58 

58 

58 

58 

58 

59 

59 

59 

59 

59 

59 

59 

59 

60 

60 

60 

60 

60 

60 

60 

60 

60 

60 

60 

61 

61 

61 

61 

61 

61 

61 

61 

61 

61 

62 

62 

62 

62 

62 

62 

62 

63 

63 

63 

63 

63 

63 

63 

63 

63 

64 

64 

64 

64 

64 

64 

64 

64 

65 

65 

65 

65 

65 

65 



42.293 

43.486 

43.623 

44.760 

42.334 

41.938 

41.648 

41.977 

41.595 

41.958 

43.423 

43.502 

44 .044 

40.074 

39.503 

39.436 

37.983 

37.250 

35.778 

34.917 

35.249 

33.550 

33.890 

33.042 

37.557 

36.846 

37 .982 

37.626 

37 .768 

36.744 

35.429 

37 .038 

35.952 

34.962 

38.416 

39.596 

37 .754 

38.369 

37 .290 

36.544 

36.334 

39.226 

38.876 

40 .344 

41.249 

42.333 

43.304 

44 .427 

45.100 

44.619 

40.607 

40.824 

39.845 

39.204 

38.301 

37.213 

36.375 

37.188 

38.412 

38.462 

37.695 

3.6.918 

36.010 

35.339 

34.525 

33.914 



19.675 
L9.019 
19.577 
18.884 
19.355 
18.956 
17.763 
19.666 
19.070 
19.991 
20.280 
21.461 
19.044 
18.870 
18.266 
19.392 
19.276 
20.476 
20.534 
19.501 
21.628 
19.557 
21.688 
20.652 
19.139 
18.201 
20.079 
20.053 
21.449 
22.429 
22.559 
23.444 
24.159. 
23.643 
19.054 
18.805 
18.496 
17.522 
16.695 
17.541 
16.094 
18.217 
19.286 
17.606 
18.202 
17.219 
16.869 
16.022 
16.499 
14.892 
18.687 
19.816 
17.814 
18.165 
17.018 
* 16.694 
' 15.801 
17.332 
19.465 
20.331 
19.608 
20.814 
20.654 
21.946 
22.636 
23.823 



49.457 
48.794 
47.385 
46.705 
46.600 
50.731 
50.692 
51.858 
53.136 
54.322 
54.710 
55.652 
55.357 
53 .090 
53.993 
52.031 
51.823 
52.440 
52.098 
52.462 
51.399 
52.136 
51.071 
51.440 
50.345 
49.974 
49.511 
48.099 
47 .494 
47 .979 
47.683 
48.864 
49.089 
48.385 
47.253 
47.498 
46.244 
45.333 
44.614 
43.731 
45.629 
44.278 
43.792 
43.912 
42.928 
42.536 
43.609 
43.052 
42.097 
43.564 
41.639 
41.215 
40.998 
39.753 
39.295 
40.302 
40.027 
41.374 
39.902 
39.026 
41.012 
41.248 
42.467 
42.866 
41.964 
42.308 



1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 



00 
GO 
00 
00 
00 
00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 



00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
,00 
,00 
,00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 



26.10 
25.43 
26.66 
27 .12 
26.43 
25.79 
26.50 

24 .91 

25 .15 
25.44 
24 .67 

23 .70 

24 .08 
25.41 
25.88 
25.05 
24.11 
21.80 
20.07 
19.27 
19.82 
19.26 
17.45 
17.92 
24.02 
23.27 
24.40 
25.04 
24.19 
24.44 
24.12 
24.36 
23.18 
23.91 
25.60 
26.94 
26.68 
28.17 
28.15 
28.10 
28.24 
29.28 
29.52 
31.33 
32.42 
34.37 
37.20 
38.79 
37.96 
39.68 
31.96 
32.10 
31.52 
31.36 
33.99 
37.38 
39.80 
38.67 
30.02 
30.47 
27 .51 
26.03 
25.42 
24.90 
25.04 
25.01 



AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

AAAA 

.AAAA 



WO 01/18045 



ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM. 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



529 
530 
531 
532 
533 

534 

535 

536 

537 

538 

539 

540 

541 

542 

543 

544 

545 

546 

547 

548 

549 

550 

551 

552 

553 

554 

555 

556 

557 

558 

559 

560 

561 

562 
563 
564 
565 
566 
567 
568 
569 
570 
571 
572 
573 

• 574 
575 
576 
577 
578 
579 
580 
581 
582 
583 
584 
585 
586 
587 
588 
589 
590 
591 
592 
593 
594 



CD2 
CE2 
CZ 
OH 
C 
O 
N 

CA 
CB 
CG2 
CGI 
CD1 
C 
O 
N 

CA 
CB 
CG 
OD1 
ND2 
C 
O 
N 
CA 
CB 
OG1 
CG2 
C 
O 
N 
CA 
CB 
CG 
CD1 
CD2 
C 
O 
N 

CA 
CB 
CG 
SD 
CE 
C 
O 
-N 
CA 
CB 
CG 
CD 
OE1 
OE2 



CA 
C3 



M 

CA 
CB 



CG 
CD 
OE1 
OE2 



TYR 
TYR 
TYR 
TYR 
TYR 
TYR 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
THR 
THR 
THR 
THR 
THR 
THR 
THR 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
MET 
MET 
MET 
MET 
MET A 
MET A 
MET A 
MET A 
GLU A 
GLU A 
GLU A 
GLU A 
GLU A 
GLU A 
GLU A 
GLU A 
GLU A 
ALA A 
ALA A 
ALA A 
ALA A 
ALA A 
GLU A 
GLU A 
GLU A 
GLU 
GLU 
GLU 
GLU 



A 
A 
A 
A 



A 
A 
A 
A 
A 

A 

^ 

A 
A 
A 
A 
A 
A 
A 
A 
A 
A 

rv 

A 



A 
A 
A 
A 

A 
A 
A 
A 
A 
A 



55 

55 

65 

65 

65 

55 

56 

56 

66 

56 

66 

66 

56 

66 

67 

67 

67 

57 

67 

67 

57 

57 

68 

68 

58 

68 

58 

58 

68 

69 

59 

69 

69 

69 

69 

59 

69 

70 

70 

70 

70 

70 

70 

70 

70 

71 

71 

71 

71 

71 

71 

71 

71 

71 

72 

"72 

72 



73 
73 
73 
73 
73 
73 



224/263 

Figure 19-9 
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